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canceled. Appeal is taken from the rejection of all pending claims. 

Status of Amendments 

Amendments made in Applicant's First Amendment After Final, mailed on December 27, 
2004, have been entered for purposes of appeal, and are reflected in the Claims Appendix 
attached hereto. 

Summary of Claimed Subject Matter 

The invention is a new process for constructing biologically active ligands with improved 
specificity and binding affinity for biological targets, by starting with a hgand that is known to 
bind a biological target, identifying a second ligand that binds to the biological target, and 
constructing a composite ligand from the two ligands in which the distance between the two 
ligands in the composite ligand, and the orientation of the ligands to one another in the 
composite ligand, closely mirrors their relative distance and orientation when bound to the 
substrate as separate ligands. 

In a principal embodiment, described in the specification as the "Direct Method" on 
pages 9:24-10:14, the process is carried out according to the following steps: 

1 . paramagnetically labeling a ligand that is known to bind to a biological target; 

2. preparing a complex between the paramagnetically labeled ligand and the biological 
target; 

3 . preparing NMR spectra of the complex; 

4. modifying the complex by the addition of a second ligand that binds to the biological 
target; 

5. preparing NMR spectra of the modified complex; and 

6. analyzing the spectra to determine whether the second ligand has bound to the 
biological target within the paramagnetic zone of the label on the first ligand; 

7. further analyzing the spectra to determine the distance of separation between the first 
and second ligand when bound to the biological target; 

8. deducing the relative orientation of the first and second ligands when boimd to the 
biological target; and 

9. linking the two ligands together at the distance of separation and orientation deduced 
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in steps 8 and 9. 

The combined ligands have a greater specificity and affinity for the biological target than either 
of the ligands by itself 

Independent Claim 1 — Independent claim 1 covers the "Indirect Method" described 
above and in the specification at 9:24-10:14, but broadens it sUghtly to cover the several ways 
that the NMR spectra can be generated for the biological target and the second ligand. For 
example, the spectra can be generated from a complex formed between the biological target and 
the second ligand, and compared to the spectra from the complex formed between the biological 
target and the first ligand, or spectra can be generated fi*om a tri-part complex formed between 
the biological target, the second ligand and the first ligand, and compared to the spectra fi"om the 
complex formed between the biological target and the first ligand. Support for this expansion is 
contained in the specification at 10:26-10:30. 

Independent Claim 19 ~ Independent claim 19 covers a variation of the foregoing 
process, referred to in the specification as the "Direct Method," in which NMR spectra are also 
prepared of the second ligand and a paramagnetically labeled derivative of the first ligand in the 
absence of the biological target, and compared to spectra derived from a tri-part complex formed 
between the biological target, the second ligand and the first ligand, and spectra derived from a 
complex of the biological target and only one of the ligands, to deduce the relative distances of 
the ligands when bound to the biological target. This method is described in the specification at 
11:1-11:18. 

Grounds of Rejection to be Reviewed on Appeal 

The following issues are presented on appeal; 

(1) whether the preamble in claims 1 and 19 is imclear because it recites "a method for 
improving the binding affinity of ligand for a biological target," when the claim does not 
specifically recite an improvement in the binding affinity. 

(2) whether step (c) of claim 20 is rendered unclear by the presence of the term 
"substantially." 

(3) whether the claims are properly rejected as obvious over the combined teachings of 
Johnson et al. (1999) and Bolon et al. (1999). 
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Argument 

Claim Rejections - 35 U^S.C, §112, second paragraph 

The Final Office Action rejects claims 1 and 19 because they recite a method for 
improving the binding affinity of a ligand for a target and do not provide a step of improving the 
binding affinity. This rejection is improper because (1) the objectionable limitation is found in 
the preamble, which does not even limit the claim, and (2) the binding affinity is inherently 
improved in a significant number of constructs whenever one carries out the recited sequence of 
steps. 

The Final Office Action rejects claim 20 because clause (c) contains the term 
"substantially." Applicant respectfiilly submits that a skilled worker would understand precisely 
what is meant by the term substantially, since it refers to a distance and orientation that improves 
the binding affinity of the resulting compound. 

Claim Rejections - 35 U.S.C. § 103 

The Final Office Action also rejects the claims as obvious based on the combined 
teachings of Johnson (1999) and Bolon (1999). During a telephone interview conducted on 
March 15, 2005, the Examiner informed Apphcant's attorney that she was relying on page 620, 
in column 2 of Johnson, which describes composite protein that Johnson constructed by linking 
together two cellulose binding domains (CBDni and CBDn?) of the CenC beta-l,4-glucanase 
enzyme. This linking of two protein domains is in contrast to the linking of two parts of a ligand 
that binds to a protein as described in the instant claims. Johnson was studying the composite 
protein with the two CBDs to gain some understanding of the binding characteristics of the 
CBDs to cellulose, but ultimately he concluded that the relative orientation of the CBD protein 
domains did not even matter, because "the tandem CBDs anchor CenC to its natural substrate . . . 
without a strong preference for their orientations." (P. 620 at col. 2.) 

Johnson did not link the CBD domains of the composite protein at a preferred orientation 
and distance to improve their binding affinity for a ligand, or undertake any steps to characterize 
that preferred distance and orientation, all as required by the claims of the present invention. He 
does not identify the structure of the linker, the distance between the two CBDs when bound to 
the linker, or the relative orientation of the two CBDs when boimd to the linker. In fact, he states 
just the opposite in column 2 of page 620 when he states that "the spatial arrangement of these 
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modules (i.e. the two CBD ligands) within native C.Jimi CenC remains to be defined." 
Continuing in the second column of page 620, Johnson further confirms the point when he states 
that the two ligands "cannot bind simultaneously to adjacent regions of a single polymer chain" 
due to "structural constraints" in the linker. 

Johnson never addressed the issue of a distance between a first and second ligand and 
never "determine[ed] whether a second ligand perturbs peaks on the second NMR spectra that 
are also perturbed by the paramagnetic label on the first NMR spectra," as required by clause (d) 
of claim 1 . That step would only be performed if one was interested in determining whether the 
two ligands bound to the target molecule at an ascertainable distance, and constructing a 
composite ligand at the distance ascertained. Moreover, no orientational data fi-om dipolar 
couplings were collected as described in claims i.e. 7, 19(e) and 20(b). The "orientation" of the 
cellulose-derived ligand recited by Johnson actually refers to a "direction" of binding in the 
protein site which proves to be ambiguous, as opposed to a three dimensional orientation. This 
latter information was also dependent on use of assigned protein resonances, a step not needed in 
the present invention. 

To overcome these deficiencies, the Examiner makes reference to the fact that Johnson 
reports some spin label studies that he conducted in column 1 of page 620. These studies were 
performed to locate the binding site of a single ligand on the protein surface, not to define 
distances between two ligands; again it required assignment of protein resonances. Once again, 
this represents a fundamental difference between the method Johnson was employing and the 
method described in the instant claims. 

The Examiner also cites to Bolon (1999) to cure these deficiencies, but Bolon merely 
reports the availability of residual dipolar couplings to deduce the relative orientation of a ligand 
when bound to a biological target based upon observations of residual dipolar couplings. The 
article only analyzed the orientation of a single ligand relative to a protein structure. The authors 
did not bind two different ligands to the protein structure — as required by the pending claims — , 
deduce the relative orientations of the two ligands - as required by the pending claims — or 
determine the distance between the ligands - again as required by the pending claims, because he 
was not concemed with improving the binding affinity of a ligand for the biological target by 
making composite ligands. In short, the reference did not offer any suggestion or motivation for 
constructing composite ligands based upon their relative distances and orientations when bound 
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to a protein, as recited in the pending claim. Lacking such a suggestion or other motivation, the 
references do not support a prima facie case of obviousness. 



Li view of the arguments presented herein. Applicants respectfully request that the final 
rejection in this matter be vacated, and that this application be returned to the examiner with 
instructions to enter a notice of allowance. 



March 22, 2006 

KING & SPALDING LLP 

191 Peachtree Street 

Atlanta, GA 30303 

(404) 572-4600 (Telephone) 

(404) 572-5145 (Facsimile) 

K&S Docket No. 04342.105053 



CONCLUSION 



Respectfully submitted 




Clark G. Sullivan 
Reg. No. 36,942 
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APPENDIX 1 
CLAIMS APPENDIX 

1) A method for improving the binding affinity of a Ugand for a biological target comprising: 

a) preparing first NMR spectra of a first complex comprising the biological target and a 
paramagnetically labeled derivative of a first ligand to define a paramagnetic zone; 

b) preparing second NMR spectra of a second complex comprising the biological target 
and a second ligand; 

c) analyzing the spectra to determine whether the second ligand binds to the biological 
target within the paramagnetic zone of the paramagnetically labeled derivative; 

d) deducing a relative three-dimensional orientation of the first and second ligands when 
bound to the biological target; 

e) deducing a distance of separation of the first and second ligands when bound to the 
biological target; and 

f) selecting or preparing a compound that contains the first and second ligands 
substantially in the relative orientation and distance, wherein the binding affinity of 
the first ligand for the biological target is improved. 

2) The method of claim 1 wherein step (c) is performed by: 

a) identifying peaks on the first NMR spectra that are perturbed by the paramagnetic 
label; and 

b) determining whether the second ligand perturbs peaks on the second NMR spectra that 
are also perturbed by the paramagnetic label. 

3) The method of claim 1 wherein the first complex further comprises the second ligand, and 
step (c) is performed by determining whether the paramagnetically labeled derivative of the 
first ligand perturbs peaks associated with the second ligand. 

4) The method of claim 3 further comprising, before step (c), preparing third NMR spectra of 
a mixture of the paramagnetically labeled derivative of the first ligand and the second 
Ugand in the absence of the biological target. 

6) The method of claim 1 wherein the distance of separation is determined as a function of the 
loss of intensity for NMR resonances from the second ligand. 
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7) The method of claim 1 wherein the three dimensional orientation is deduced by producing 
a field ordered state in a medium comprising the biological target, the first ligand, and the 
second ligand, and analyzing dipolar couplings within the first and second ligands. 

8) The method of claim 7 wherein the field ordered state is produced by an aqueous dispersion 
of lipid bicelles having complementary charges to the biological target. 

9) The method of claim 7 wherein the field ordered state is produced by an aqueous dispersion 
of bacteriophage having a domain of the biological target in the outer coat. 

10) The method of claim 1 wherein the paramagnetic label is a nitroxide or metal chelate. 

11) The method of claim 1 wherein the first and second NMR spectra are two dimensional 
heteronuclear single quantum coherence spectra. 

12) The method of claim 1 wherein the biological target is isotopically labeled. 

13) The method of claim 1 wherein the biological target is a protein, and NMR resonances fi*om 
the protein are not assigned to a sequence of the protein. 

14) The method of claim 1 wherein the biological target is a protein, and the three dimensional 
conformation of the protein is unknown. 

15) The method of claim 1 wherein the first ligand is an oligosaccharide, and the biological 
target is a protein. 

1 6) The method of claim 1 wherein the second complex comprises the first ligand, or a 
paramagnetically labeled derivative thereof 

19) A method for improving the binding affinity of ligands for biological targets comprising: 

a) preparing first NMR spectra of a first complex comprising a biological target, a 
paramagnetically labeled derivative of a first ligand, and a second ligand; 

b) preparing second NMR spectra of a second complex comprising the biological target 
and either the second ligand or the paramagnetically labeled derivative of the first 
ligand; 

c) preparing third NMR spectra of a mixture of the paramagnetically labeled derivative 
of the first ligand and the second ligand in the absence of the biological target; 

d) analyzing the spectra to determine whether the paramagnetically labeled derivative of 
the first ligand perturbs peaks associated with the second ligand; 

e) deducing a relative three-dimensional orientation of the first and second ligands when 
bound to the biological target; 
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f) deducing a distance of separation of the first and second ligands when bound to the 
biological target; and 

g) selecting or preparing a compound that contains the first and second ligands 
substantially in the relative orientation and distance, wherein the binding affinity of 
the first ligand for the biological target is improved. 

20) The method of claim 19 further comprising: 

a) deducing from the NMR spectra the distance between the first and second ligands 
when bound to the biological target; 

b) deducing fi-om the NMR spectra the relative three dimensional orientation of the first 
and second ligand when bound to the biological target; and 

c) selecting or preparing a hybrid ligand that contains the first and second ligands 
covalently linked substantially at the bond distance and relative orientation deduced in 
steps (a) and (b). 
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APPENDIX 2 
EVIDENCE APPENDIX 

Attached hereto are copies of the evidence entered by the Examiner and relied upon by 
appellant in the appeal. This evidence was cited by the Examiner at the following points during 
prosecution of this application: 

Johnson et al., J. Mol. Biol. (1999) 287:609-625, cited by the Examiner in Office Actions 
mailed February 25, 2004 and October 26, 2004. 

Bolon et al., J. Mol. Biol. (1999) 293:107-1 15, cited by the Examiner in an Office Action 
mailed October 26, 2004. 
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The Cellulose-binding Domains from Cellulomonas 
fimi p-1,4-Glucanase CenC Bind Nitroxide Spin-labeled 
Cellooligosaccharides in f\/lultiple Orientations 

Philip E. Johnson, Emmanuel Brun, Lloyd F. MacKenzie, 
Stephen G. Withers and Lawrence P. Mcintosh* 

The N-tenninal cellulose-binding domains CBD^i and CBDn2 from 
Cellulomonas firm cellulase CenC each adopt a jelly-roll p-sandwich struc- 
ture with a deft into which amorphous cellulose and soluble cellooligo- 
saccharides bind. To determine the orientation of the sugar chain wi^iin 
these binding clefts, the association of TEMPO (2Z6,6-tetramethylpiperi- 
dine-l-oxyl-4-yl) spin-labeled derivatives of cellotriose and cellotetraose 
with isolated CBDnj and CBDpc was studied using heteronuclear ^H-^N 
NMR spectroscopy. Quantitative binding measurements indicate that the 
TEMPO moiety does not significantly perturb the affinity of the celloo- 
ligo-saccharide derivatives for the CBDs. The paramagnetic enhance- 
ments of the amide ^H^ longitudinal (ARi) and transverse (AKj) 
relaxation rates were measured by comparing the effects of TEMPO<ello- 
tetraose in its nitroxide (oxidized) and hydroxylamine (reduced) forms 
on the two CBDs, The bound spin-label affects most significantly the 
relaxation rates of amides located at both ends of the sugar-binding deft 
of each CBD. Similar results are observed with TEMPO-cellotriose bound 
to CBDnj. TTiis demonstrates that the TEMPO-labeled cellooligosacchar- 
ides, and by inference strands of amorphous cellulose, can associate with 
CBDmi and' CBDpc in either orientation across their ^heet binding clefts. 
The ratio of the assodation constants for birkling in each of th^ two 
orientations is estimated to be within a factor of five to tenfold. This find- 
ing is consistent with the approximate symmetry of the hydrogerv-bond- 
ing groups on both the cellooligosaccharides and the residues forming 
the binding clefts of the CenC CB£)s. 

C 1999 Academic Press 

Keywords: proteinorbohydrate interactiorv NMR; spin label; 
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intrnrliirtirm i™' cdlulose-binding domains, CBD^^ and 

iniroauciiuii CBD^c(ToI^me et al., 1995). In contrast to most 

The B-l,4-glucanase CenC from Cellulomonas fimi CBDs which bind crystalline cellulose, the homolo- 

is modular in structure and hinction (Coutinho gous family IV CBD^i and CBD^j from CenC are 

€i a/., 1992, 1991)- Its central catalytic domain is unique in tiieir specifidty for amorphous cellulose 

competed by a short linker to two tandem N-term- and soluble cellooligosaccharides. As part of a col- 

* 

Presoit address: P. E. Johnson, Howard Hughes Medical Institute, Departmeit of Chemistry and Biochemistry, 
University of Maryland Baltimore County, Baltimore, MD 21250, USA. 

Abbreviations used: CBD, cdlulose-binding domain; CBDc«/ the cdlulose-binding domain from die mixed 
cellulase/xylanase Cdluiomonas fimi Cex; CBDni, the N-tenninal cellulose-binding domain from Cellulomonas fimi 
H 4^1ucanase CenC; CBE^, the cdlulose-binding domain from Cellulomonas fimi p-l,4-glucanase CenC following 
CBEX^ in sequence; CBI>nim2, the tandem cdlulose4>inding domains from Cellulomonas fimi p-l,4-glucanase CenC; 
HSOC heteronuclear single quantum correlation; NOE, nudear Overhauser effect; pH*, ttie observed pH meter 
reading without correction for isotope effects; TEMPO, 2,2,6,6-tetramethvlpiperidine-l-oxyl-4-yl; TENfPOGlCa, 
TEMPOlabeled cellotriose; TEMPO-GIC4, TEMPOlabeled cellotetraose. 
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laborative effort to exploit these domains for use in 
biotechnology, we have used NMR spectroscopy 
and calorimetry to provide a structural and 
thermodynamic foundation for understanding the 
distinct binding properties of CED^^ and CBDnj 
(Creagh et aU 1998; Johnson et aL, 1996a,b, 1998; 
Tomme et al, 1996a). 

CBDni Qohnson et al„ 1996b) and CBDfc 
are composed of two five-stranded antiparallel 
(E.B., P£J. & L.P.M., unpublished results) ^-sheets 
that fold into very similar jelly-roU sandwich struc- 
tuies. As demonstrated by NMR chemical shift 
perturbations and the observation of intermolecu- 
far protein-sugar NOEs, a single cello- 
oligosaccharide molecule biiuls to these CBE>s 
within a groove or cleft that runs across one 
p-sheet face of each protein domain (Johnson et ai, 
1996a,b). The presence of a cleft readily explains 
the binding spedfidty of these CBDs. Soluble cel- 
looligosac(±iarides and single polysaccharide 
chains, as might be encountered in me amorphous 
regions of cellulose, bind within this cleft, whereas 
flat crystalline arrays of cellulose are exduded. 
In agreement with the observation that the affinity 
of CBDm] for cellooligosaccharides increases in 
the order cellotriose < ^otetraose < cellopentaose 
^ cellohexaose, the sugar binding sites in me CenC 
CBDs can each be spanned by approximately five 
glycosyl units Qohnson et al,, 1996a^; Tomme et al., 
1996a). 

In parallel with these structural studies, a 
detailed calorimetric analysis of the association of 
CBD|sji with polysaccharides was also conducted 
(Creach et ai, 1998; Tomme et aL, 1996a). CBDnj 
was found to bind regenerated (phosphoric acid 
swollen) cellulose and other soluble ^1,4-linked 
polymers of glucose, such as hydroxyethyl cellu- 
lose aiul barley and oat-^-glucan, with affinities 
equal to those measured for cellopentaose aiul 
cellohexaose. Formation of the CBD^i^-sugar com- 
plexes is favored enthalpically, indicating that 
hydrogen bonding and van der Waals interactions 
provide the primary driving force for the binding 
event. Inspection of the structures of CBD^i and 
CBDm2 reveals that their binding defts consist of a 
central strip of hydrophobic side<hains, flanked 
on both sides by polar residues. This led us to 
propose that the pyranose rings of cellooligosacch- 
harides, and by inference single polysaccharide 



chains in regions of amorphous cellulose/ are 
stacked: against the hydrophobic strip, while the 
flanking hydrophilic residues provide hydrogen 
bonds to the equatorial hydroxy groups of the 
sugar Oo^<^ ^ 1996a,b). Such interactions 
are often observed with carbohydrate47inding pro- 
teins (Quiocho, 1989, 1993; Vyas, 1991). 

The structures of CBD^i and CBDm2 vveie calcu- 
lated using NMR data collected for these protein 
domains in the presence of saturating coiKen- 
trations of cellotetraose or cellopentaose,. 
respectively (Johnson et al., 1996b). Numerous 
intermolecular nudear Overhauser effects (NOEs) 
between the ^^/^^N-labded protons and Ae 
unlabeled cellooligosaccharides were observed in 
isotope edited /filtered NOESY expaiments 
(Otting & Wiithrich, 1990). However, the NMR 
spectra of the cellooligosacdiarides are highly 
degenerate, thus preventing the unambiguous 
assignment of these NOEs to specific interactior s 
between sugar and protein protons. This preduded 
the direct determination of the structures of the 
CBI>-celluoligosaccharide complexes. 

As a first step towards achieving this goal, wc 
have tocussed on defiiung the orientation of a cel- 
lulose diain within the binding defts of CBDni 
and CBDn2- Exploiting a novel ^ycosynthase tech- 
nology (Mackenzie et al,, 1998), we have prepared 
derivatives of cellotriose and cdlotetraose with a 
2,2,6,6-tetrameUiylpiperidine-l-oxy-4-yl (TEMPO) 
spin-label covalently attached io ^ reducing end 
of the sugar (Pigtire 1). The use of such paramag- 
netic relaxation probes has a long history in the 
study of biologiral macromolecules by NMR spec- 
troscopy (Kosen, 1989). The nitroxide moiety cose 
tains an unpaired dectron that provides an 
effident mechanism for the relaxation of neighbo''- 
ing nudei via dipolar coupling. EXie to the magni- 
tude of the electron magnetic moment, this 
interaction extends to over 20 A. In contrast, pro- 
ton-proton NOEs are limited to sq>arations of less 
than ~5 A. Thus, these TEMPO-labeled sugars can 
be utilized to obtain long-range distance infor- 
mation about the CBD-cellooligosaccharide com- 
plexes, "in practice, the excellent dispersion of ^^ttie- 
signals-^rom the amide ^H-^N groups, combined 
with the high degree of sensitivity of the HSQ<Z 
experiment (Cavanagh et al,, 1996), allows the 
effect of the spin-labd cellooligosaccharides on 
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Figure 1. Chemical structures 
of the spin-labeled glycosides: (a) 
TEMFO-Glca and (b) TEMPO-Cilc4. 
Reduction of the paramagnetic 
nitroxide (N-O) moiety produces 
\he diamagnetic hydroxylamine 
(N-OH) derivatives of these 
glycosides. 



each non-proline residue in ^N-labeled CenC 
CBDs to be evaluated. Similar studies using spin- 
labeled ligands and heteronudear NMR methods 
have been reported by several groups (Gellespie & 
Shortle, 1997a,b; Zhao et oi., 1997; Kleerekopa- et a/., 
1995; Yu et al., 1994). Using this approach, we 
demonstrate that the modified su^rs bind to 
CBD^i and CBDn2 ^ t>oth possible orientations 
across thdr ^heet binding clefts. TWs result is 
consistent with the approximate symmetry of the 
cellooligosaccharides and the positions of ttie side- 
chains forming the binding clefts of the two protein 
domains, as well as relaxation measurements 
which indicate that residues within the binding 
cleft of CBDisn remain conformationally flexible in 
the presence of these sugars (P.EJ,, E.B. & LP-NL, 
unpublished results). 

Results 

Preparation ol TEMPO- 
iabeled cellooligosaccharides 

Syntheses of the nitroxide spin-labeled cellotrio- 
side (TEMPO-Glc?) and cellotetraoside (TEMPO 
GIC4) were accomplished in a two-step chemo- 
enzymatic process, thereby avoiding many of the 
difficulties inherent in the preparation of oligosac- 
charide derivatives by dusmical methods alone. 
A TEMPO-labeled glucoside was first synthesized 
via standard Koenigs-Knorr methodolqgy, essen- 
tially according to a published procedure 
(Gnewuch & Sosnovsky, 1986; Hessas & Goldstein, 
1981). This product was characterized by mass 
and, to a limited extmt, 'H-NMR spectroscopy. 
Full NMR spectroscopic charactmzation was per- 
formed on the catalytically hydrogenated (reduced) 
and acetylated hydroxylamine derivative that no 
longer bears a free radical. Conversion of the 
TEMPO-glucoside to TEMPOGIC3 and GIC4 was 
accon^lished by successive transfer of glucose 
residues from a-glucosyl fluoride using a mutant 
Agrobacterium sp. p-glucosidase (glycosynthase) in 
which the catalytic nudeophile, Glu358, was 
replaced by an alanine residue (Mackenzie et ai, 
1998). This approach exploits the well-known 
transglycosylation reaction catalyzed by retaining 
glycosidases, yet has the advantage that the 
mutant enzyme cannot hydrolyze the end pro- 
ducts, thereby resulting in good yields of --70% 
per coupling. The products were purified by HPLC 
and their identities confirmed by ionspray mass 
spectrometry, and after reduction and acetylation, 
by ^H-NMR spectroscopy. 

Binding of TEMPO-labeled 
cellooligosaccharides to CBDni and CBDn2 

We have demonstrated previously, using NMR 
spectroscopy and isothermial titration calorimetrv, 
that the monomeric CBDni binds soluble cellooli- 
gosaccharides with a stoichiometry of 1:1 (Johnson 
et al., 19%a; Tomme et aL 1996a). Similar results 




Tocal Ugand Concentration (mM) 

Figure 2. The association constants of the CenC CBDs 
for TEMPO-labeled cellooligosaccharides were measured 
using *H-"N NMR spectroscopy. Shown are the nor- 
malized chemical shifts changes, or fraction boimd, of 
the amide proton of AsnSl in CBDni upon titration 
widi TEMPCM3IC3 (filled diamonds) and TEMPO-GIC4 
(filled squares), and die nitrogoi atom of Vall97 in 
CBDm2 upon addition of TEMPOGIC4 (open squares). 
The solid and broken lines represent die biest fits of the 
data to the Langmuir isotherm describing tiie binding of 
one Ugand to a single protein site on CBD^i or CBDm2, 
respectively. The total chemical shift changes (or Asn8l 
and Vall97 were extrapolated to be 0.1 ppm (*H^) and 
0.76 ppiil (^'N), respectively. 



have been obtained with CBCVq (data rK>t shown). 
Continuing this approach, the interactions of ^e 
TEMPC)-labeied cellooligosaccharides with CBD^^^ 
and CBDn2 vvere analyzed quantitatively by moni- 
toring the ^H*^ and chemical shifts of the pro- 
teins upon titration with these soluble sugars. The 
spectral changes resulting from the addition of 
TEMPQ-GIC3 and TEMPO-GIC4 are similar to those 
observed with the corresponding unlabeled celloo- 
ligosaccharides, indicating that these ligands all 
bind in the regime of fast exchange on the chemical 
shift timescale to the same regions in CBDni and 
CBD|^ and with the same 1:1 stoichiometry fdata* 
not shown). Apparent or macroscopic equilibrium 
association constants were obtained by fitting the 
titration data to the Langmuir isotherm describiitg 
the binding ot one Ugand molecule to a single pro- 
tein sit^,(Figure 2). 
TEMfO-Glc3 binds to CBDni with an apparent 
of 690(±260) M-^ This is marginaUy higher 
thian the value of 180(±60) M"^ measured pre- 
viously for unlabeled cellotriose Gohnson et aL, 
19%a). -The apparent values of TEMPO-GIC4 
for CBDni and CBD^ are 42(X}(±400) M'* and 
36(X)(±2200) M'\ respectively, while the associ- 
ation constants of CBDni and CBDn2 for unlabeled 
cellotetraose are 42(X)(±720) M^^ Qohnson aL, 
1996a) and 7(»(K)(±650) M'* (data not shown). The 
relatively lar^e errors in the values measured 
for the TEMPO-labeled cellooligosaccharides ralect 
the fact that the nitroxide efficiently broadens the 



signals from many amide groups within the bind- 
ing deft to the point that they are no longer obser- 
vable. In general, it is these amide groups that 
show the largest chemical shift changes upon 
addition of sugar to the protein and would other- 
wise be best suited for fitting to the binding iso- 
therm equation. Within the error of the data, we 
conclude that the TEMPO moiety does not signifi- 
cantly perturb the affinity of the CenC CBDs for 
these cellooligosaccharides. By way of con^arison, 
increases in affinity of '^lO to 20-fold result from 
the lengthening of cellotriose or cellotetraose by 
one additional glycosyl urut to form cellotetraose 



or ceilopentaose, respectively CI*omme et at., 
19%a). 

TEMPQrlabeled cellooligosaccharlcies bind 
CBDni :pnd CBDn2 in multiple orientations 

In acMition to perturbing the chemical shifts of 
amide groups in CBDi^n and CBDisq, the binding of 
TEMFO-GIC3 and GIC4 causes a decrease in the 
intensities of the signals from many of these 
groups due to paramagnetic relaxation enhance- 
ment Tius is readily seen by coix^>anng tite ^H-**N 
HSQC s^>ectra of two proteins complexed witt\ the 
oxidized and reduced cellotetraose derivative 
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^n of CBI>i^i and CBD^i/ respectivery- Peaks from amides that "disappear" l>eca 
die nearby unpaired dectron in the nitroxide-labeled cellotetraose are identified 
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(Figtire 3). Qualitatively, it is dear that a number 
of resonances present in the spectra of proteins 
bound to the reduced hydroxylamine form of the 
sugar are "absent" in the specta recorded with the 
oxidized nitroxide form, i.e. their intensities fall 
below a very low contour level. Thus, these amide 
groups must be dose ( ^ 11 A; vide infra) to the free 
dectron in the nitroxide moiety. 

The amide protons whose resonances are absent 
in the *H-^N HSQC spectra of the coit^lexes of 
TEMPO-GIC4 with CBDni and CBDn2 can be classi- 
fied into two groups that are located on the oppo- 
site eiKls of the binding defts of these proteins. (As 
shown in Figure 5, the biiuling deft of each CBD 
lies across an antiparaUd p-sheet A formed by five 
strands, arranged as A1-A2-A5-A3-A4.) In the case 
of CBDni, the first group indudes Vall7, Alal8, 
Tyrl9 and ay20 from p-strand Al, while the 
second group conyrises Tyr85, Qy86 and HirS? 
from p-strand A4 and the loop between A3 and 
A4. Similarly, for CBDmj/ the first group is formed 
by Serl66, Leul67, Tyrl68, Glyl69 from p-strand 
Al, while the second group indudes of Tyr234, 
Arg235 aiKi Thr236 from P-strand A4 and the pre- 
cemng loop. In contrast, peaks from amide protons 
in the central p-strand A5 of each CBD are present 
in the ^H-^N hBQC spectra. Inspection of the ter- 
tiary structures of CBDni and CBDn2 reveals that 
the outermost strands, Al and A4, are separated 
by ^^20 A. The observation that the amide protons, 
which are most strongly perturt)ed by the nitroxide 
spin labd, are located in both of these outer 
p-strands cannot he accounted for by the binding 
of TEMPO-GIC4 to the CBDs in a single orientation. 
The simplest explanation of this result is that 
TEMPO-GIC4 associates with CBDnj and CBDn2 
at least two distiiKl orientations, such that the 
nitroxide moiety lies at either edge of their sugar 
binding defts. 

The peaks from amide protons in p-strands Al 
and A4 that are absent in the spectrum of CBDn^ 
bound to TEMPCKJIC4 also disappear upon bind- 
ing TEMPOGIC3 (see Supplementary Material). In 
addition, signals from several amide protons, 
including those of ThrZl in strand Al and Ala41, 
Gln42% Tyr43, Val48, AsnSO, and GlySl in strand 
A2, are weak in the spectrum of CBDnj complexed 
with TEMPO-Glc^ arui absent in die presence of 
TEMPO-GIC3. This qualitative distinction may 
reflect a slight difference in the positions of the 
two sugars when bound to CBDni- Several pro- 
tons, induding the indole H'^ of Trpl6 and the 
amide H*^ of Leul46, which are not in the binding 
cleft of CBDni . are also affected by TEMPOGlc:^. 
This is attributed to non-specific effects arising 
from the addition of a 22-fold molar excess of the 
nitroxide-labded sugar to the protein sample. 

Measurement of Af?^ and A/is 
relaxation enhancements 

A qualitative inspection of the ^H-^^N HSQC 
spectra of the two CBD-TEMPO-GIC4 complexes 
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provides immediate insights into the binding 
orientations of the sugars. Further details regarding 
the positions of the bound rutroxide moiety can be 
gained by a quantitative analysis of the effects of 
the spin label on the rdaxation properties of the 
two CBDs. To reduce the possibility of non-spedfic 
interactions, CBDni and CBDisq wa« studied at 
saturation levels of approximatdy 80% and 65%, 
respectivdy (Figure 2). 

Hie enhancements of the longitudinal (ARO and 
transverse (AR2) relaxation rates of CBDni and 
CBD^Q due to complexation with TEMPO-GIC4 are 
presented in Figures 4 and 5. Data for CBDni 
bound to TEMPOGIC3 are provided as Supplemen- 
tary Material. The AR^ values were determined 
from the differences in the Ti lifetimes, measured 
using a ^H-^^N HSQC sequence as a readout 
of a non-selective inversion-recovery sequence 
(Cavanagh et al,, 1996), for the proteins in the pre- 
sence of the nitroxide and hydroxylamine deriva- 
tives of cdlotetraose. The AK2 values were 
obtained using two complementary methods. In 
the first, the effect of the spin labd on the proton 
transverse relaxation (AR2V01) of the ^H*^ during 
the INEPT and reverse-INEPT period of the HSQC 
pulse sequence was measured by comparing the 
total vcdume of its cross-peak in spectra recorded 
with the oxidized and reduced sugar. In the 
second;^A/^2.Lw was determined from the change 
in the proton line-width of the amide measured 
with the two forms of the modified cellotetraose. 
The methods for determining ARi arui AR2 assume 
that contributions to the peak volumes and liiie- 
widths due to scalar couplings and relaxation pro- 
cesses other than that mediated by the free radical 
remain constant, and that the oxidized and 
reduced glycosides interact identically with the 
CBDs. The lack of any significant amide ^H^ or 
^^N chemical shift perturbation upon reduction or 
the TEMPO group provides strong support for 
these assumptions (Figure 3). 

Following these approaches^ 101 ARi, 64 
AK^vd/ ai^d 99 ARj^lw values were obtained for 
CBDni ^d 113 ARi, 44 ARj^voi. and 118 A/Ch w. 
values for CBD^q (Figure 4). Rate enhancements 
were not measurable for the remaining non-prO- 
line residues for two reasons, besides spectral 
overlap. First, in the case of those amide protons 
that are closest to the nitroxide moiety, severe 
line-broadening rendered their signals undetect- 
able in the spectra recorded with oxidized 
TEMPO-GIC4- For these residues, we estimate 
AK, >2;5s"' and AKi > SO s"^ Second, in the 
case uf those amide protons furthest trom the 
nitroxide group, changes in their relaxation rates 
were top small to measure reliably. In particular, 
dilution effects due to the addition of ascorbic 
add and subsequent pH adjustments caused the 
volumes of the peaks in the ^H-*^ FISQC spec- 
tra of the proteins recorded with the reduced 
sugar to decrease slightly. This led to small 
apparent negative AR2.V01 values for many 
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Figure 5. Paramagnetic relaxation studies demonstrate that TEMFO-GIC4 binds to each CenC CBD in at least two 
orientations sudi tftat die nitroxide moiety can lie at either end of the binding deft. Shown are ribbon diagrams of 
CBDni QeSt) and CBDfga (ngbt) positioned for ttte reader to look across (top) and down onto (bottom) the cdlooligo- 
saccharide binding groove that is formed by ^-sheet A in eadi protein. The individual strands of fliis p-sheet are 
labeled. The ribbms are raiim-colored based upon AJ^2.lw values ranging from 1 Hz (lig^t yellow) to 80 Hz (dark 
orange). Residues whose ^H^ signals disappear completely in the presence of TEMPO-Glci are indicated in red 
(AK2.LW > W Hz)" ^® proline residues and other residues for which no results were obtained are shown in white. 
The gieoi and blue spheres indicate repres^tative positions of tfie nitroxide groups calculated using an ensemble- 
averaging protocol in X-PLOR widi measured r^ff as distance restraiitts. Note ^t die TEMPO-GlCi can occupy only 
one of these two orientations at any given time. 



amides whose relaxation rates were not perturbed 
sigruficantly by the spin label. 

As seen in Figure 4, AR-^ and AK^ show similar 
trends, yet the latter provide a more sensitive indi- 
cator of the effects of the spin-labeled sugars on 
CBDni and CBDn2- ^ the cases of amide protons 
with resolved ^H-^^N HSQC cross-peaks that 
experience moderate paramagnetic relaxation 
enhancements, the values of Al?2,voi and ARjxw 
agree well (ratio = 1.7(±1)). However, a more com- 
plete data set could be obtained by measuring 
changes in line-width rather than volume due to 
the above mentioned problems associated with 
dilution effects and difficulties encountered with 
partially overlapping pealcs. 

Based on these measurements, it is immediately 
apparent that amide groups in four regions of 
CBDni and four regions of CBDn2 experience 
significant paramagnetic relaxation enhancements 
due to the bound TEMPCM3IC4. These include 



Glyl5 to ThrZl, Tyr43 to AsnSl, GlnSO to Asp90 
and Glul22 to Serl33 in CBDni and GIvl60^ 
GlulTZ, Trpl92 to Asnl99, Glu229 to Glu239 and 
Gly266 to Ala279 in CBDn2- These residues lie 
within the five ^trands, Al to A5, and adjacent 
loops that torm the cellulose-binding clefts of 
each protein. Of these four regions, amide pro- 
teins in strands Al and A4 that comprise the 
outer edges of the binding face have markedly 
larger AKi and AR2 values than do those in the 
central strand A5. This verifies that TEMPOGIC4 
binds CZBDni and CBDn2 ui multiple orientations 
such that the nitroxide spin label is located at 
either end of the binding deft. In contrast, the 
relaxation rates of amide groups located on the 
opposite side of each protein relative to this ddPt 
(^trands 81 to B5) do not exhibit marked 
enhancements. Similar patterns are observed with 
TEMPOGIC3 bound to CBDni (Supplementary 
Material). 



Calculation of electron-proton distances 

Values of the correlation time for the fluctu- 
ation of the eiectronic-nudear dipole-dipole inter- 
action, wete calculated for individual residues in 
both CBDni and CBDn2 bound to TEMPO-GIC4 
using the measured ARy and AH^voi or AII2XW 
rate enhancements. In the case of CBDnj, values 
for 81 amide protons ranged from 0.1 ns to 3.6 ns, 
witti a mean of 1. 7 (±05) ns. For CBDnj, these ran- 
ged from 0.1 ns to 6.5 ns for 101 amide protons, 
with a mean of 1.9(±0J) ns (Figure 6). The corre- 
lation time Tc is dependent upon bodi the relax- 
ation of the electron and motions of the electron- 
proton vector (1/Tc = 1/xs + I/^r). In this equation, 
T5 is the longitudinal relaxation time of tiie free 
electron and Xr is the effective rotatioiul corre- 
lation time of the electron-proton vector. The latter 
is dependent on the motional characteristics of the 
protein-ligand complex. Since Tr is in the range ot 
10 to 10'" s for most proteins studied by solution 
NMR methods, and Xg is typically longer than 10"^ 
s for nitroxide radicals, x^ is essentially equal to xr 
(Kosen, 1989). Tlie correlation times for the global 
tumbling of CBDni and CBDn2 saturated wiSi cel- 
lopentaose are approximately 7.4 ns, as determined 
by ^^N relaxation methods (P.EJ., E.B. & LP At, 
ui^ublished results). The shorter correlation times 
extracted from the paramagnetic relaxation 
erthancements may reflect conformational mobility 
of the bound TEMPO moiety. The large degree of 
variability in the Xc values measured for the amide 
protons in CBDni and CBD^q is attributed to comr 



pounding errors from the ARi and AR2 measure- 
ments, as well as different orientation-da>endent 
effects of internal mobility on the distance between 
the free electron and tiie ^H^ of individual amide 
protons; 

Using the x^ values measured for each residue^ 
the effective electron-proton distances, re^, between 
the TEMPO-GIC4 nitroxide aiul the irulividual 
amide protons in CBD^i arul CBD^q were calcu- 
lated. A similar analysis of the data for the 
TEMPOGlCa was rK>t performed due to tiie occur- 
rence of non-specific relaxation enhancement The 
most complete set of relaxation data was obtained 
from linewidth measurements, and thus tiie dis- 
tances reported in Figure 6 were determined using 
the Ali2xw values; however, similar results were 
derived from the AR^voi enhancements. Based 
on the data in Figure 6, we find that the measured 
values of r^^ range from 11 to 23 A. Consistent 
with the pre\ ious discussion of the data priesented 
in Figures 2 and 3, the amide protons in CBDm^ 
and CBDisQ closest to the lutroxide group are 
located'- in ^-strands Al and A4 on the binding 
faces of- the proteins, whereas those that are furth- 
est form strands Bl to B5. 



Structure calculations 

The positions of the nitroxide moiety in TEMPO- 
Gk:4 bound to CBDisn and CBD|^ were calculated 
using the measured as distance restraints for a 
simple mininuzation routine in X-FLOR (Brung^, 
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Figure 6. The correlation times, v and effective electron-proton distances, r^f, calculated for (a), (b) CBDnj and (cj, 
(d) CBPn2 bound to TEMPO-GIC4. Residues whose signals disappear completely due to severe lind>roadening ave 
shown with a shaded bar at an upper bounds of r^f -= 11 A. The locations of the ^-strands forming sheets A and B' in 
the two proteins are indicated by the open and filled arrows, respectively- 
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1992). To account for multiple binding orientations^ 
we used an ensemble-averaging protocol in which 
= [l/n Zr"*]"*^* where r is the electron-proton 
distance for each of n conformations, assumed to 
be equally populated (Bonvin & Briinger, 1996). 
A total of 60 minimization runs were performed 
using CBDni or CBDnj in a system with n = 1, 2, 
or 3 rutroxide groups, modeled as "atoms" of mass 
30 amu and a van der Waals radii of 15 A. When 
only one nitroxide group is utilized, the "atom" is 
always located in Ihe middle of the binding deft, 
approximately above ^trand A5. However, this 
somtion poorly satisfies the experimental restraints 
requiring the spin label to be closer to amide pro- 
tons on the edges of the binding deft than to those 
near its center. When an ensemble of two nitroxide 
atoms is considered, we find that in 31/60 and 60/ 
60 cases for CBDni and CBD^q, respectively, one 
atom is localized to each edge of the binding cleft 
(Figure 5). In the remaining 29/60 cases for CBD(^i, 
one atom is found near the middle of the binding 
deft and the other distant from the protein. The 
lowest energies dearly occur in the former situ- 
ation, with the nitroxide group near ^-strands Al 
and A4. The difference between the results 
detained for the two proteins is attributed to the 
exact choice of the bounds used for the distance 
restraints. When an ensemble of three nitroxide 
atoms is generated, 24/60 and 42/60 cases for 
CBD^n and CBDjsq/ respectively, have one atom at 
each edge of the binding deft and the third distant 
from the protein. In the remaining 36/60 and 18/ 
60 cases, the nitroxide groups occupy a variety of 
positions, dose to and distant from the proteins. 

Based on these calculations, we conclude that 
the simplest explanation for the observed patterns 
of paramagnetic relaxation enhancement is that 
TEMPO-ac4 biiKls both CBDni and CBDfc in two 
possible orientations such that ^e nitroxide moiety 
lies at either end of the binding deft (Figure 5). In 
support of these results, the separation of ^23 A 
between the positions calculated for the nitroxide 
atoms matches approximately the length of a 
TEMPO-ac4 molecule 

Discussion 

Binding affinity for spin- 
labeled ceilooiigosaccharides 

CBDni and CBDn2 bind TEMFO-GIC3 and 
TEMPO-GIC4 with affinities that are approximately 
equal to those measured previously for the uiuno* 
dified sugars. Although the six-membered ring of 
TEMPO resembles in some ways that of a gluco- 
pyrarK>syl unit, it difiers due to the presence of 
methyl substituents and the absence of equatorial 
hvdroxyl groups. As discussed by Tomme et al 
{1996a), calorimetric studies have revealed that the 
binding of soluble ceilooiigosaccharides to CBDm^ 
is driven by enthalpically favorable interactions 
such as hydrogen bonding and van der Waais con- 
tacts. We therefore condude that the TEMPO 



group does not interact significantly with CBDr^ 
or CBDm2 and is positioned in or near the bindiin^ 
cleft soldy by virtue of its covalent bondiiig to &ve 
ceilooiigosaccharides. Hiis statement is supported 
by the observation that the chemical shift chaitges 
accompanying the binding of the modified and 
urunodified cdlooligosaccharides to the two pro- 
teins are similar. We also condude that the prq^ 
sence of an anomeric hydroxy! group does n^jt 
contribute to the binding of ceilooiigosaccharides 
by CBDNi or CBDnz. This is not une)q>ected given 
that cellulose, the natural ligand for these proteins, 
exists ii^ a highly polymeric form. 

Multiple binding orientations 

Qualitative and quantitative analyses of the 
effects ot TEMPO-GIC3 and TEMPO-Qc^ on the 
rdaxation prcnserties of CBDni and CBD142 dearly 
demonstrate that the labded ceilooiigosaccharides 
are bound in at least two orientations. These orien- 
tations position the nitroxide moiety at ether ed^ 
of the binding cleft of each CBD, such that amide 

f protons in Strands Al and A4, as well as in the 
oop between A3 and A4, are perturt>ed by the 
spin label to a greater extent that those in the cen- 
tral strand A5 (Figure 5). SirKe the CBDs form 1;1 
complexes with the ceilooiigosaccharides, these 
orientations are mutually exclusive such that iri 
any individual CBD complex, the sugar is bound 
in one direction or the other. Given that the 
TEMP04abeled sugars and the corresponding 
unmodified ceilooiigosaccharides bind to the CenC 
CBDs with similar affinities and produce similar 
chemic^ shift perturbations, we condude Gl%r- 
GIC4, and by irtiference, amorphous ceUulose and 
other soluble p-glucans, are also bound in multiple 
orientations. 

Biruiing of the unmodified and TEMPO-labded 
ceilooiigosaccharides occurs on an NMR time-scale 
of fast exchange as evident by the progressive 
change in the chenucal shitts of CBDn^ and CBDn:^ 
upon the addition of these sugars. Therefore, the 
chemical shift of an amide proton at any point m 
the titration reflects a populatioivweighted average 
of its chemical shifts in the free and all-bdimd 
forms of the CBD. As a result, the equilibrium 
assodation constants determined by NMR and 
calorimetric methods represent apparent or macros 
scopic values that are the summation of the 
microscopic binding constants for each possible 
orientation of the sugar-protein complex (Wyman 
&GiU, 1990). 

An important question to ask is whether a pre- 
ference for one binding orientation exists. During 
the titrations of CBDni and CBD^q with the nitrox- 
ide-labeled sugars, it was noted that ttie signals 
from residues on ^-strand Al disappeared at lower 
levels of saturation than did those on A4 (Johnson. 
1998). This could indicate that the binding of 
TEMPO-GIC4 in an orientation with the nitroxid^ 
moiety *near ^-strand Al is slightly favored, or 
simply ihat the nitroxide is positioned doser to the 



amides in this strand relative to those in A4. If we 
assume that the TEMPOlabeled sugars bind in fast 
exdiange between two possible orientations, then 
the measured is given by: 

Here and are the distaiKes from an amide pro- 
ton to the two positions of the free electron, ^nd/^i 
and are the fractions of total protdn bound by 
the ligand in each orientation. Using the positions 
of the nitroxide atoms shown in Hgure 5, we calcu- 
late that varying /bi from 0.1 to 0.9 does not change 
the predicted for amide protons near the edges 
of the CBDmi and CBDn2 binding clefts by more 
±20%. In other words, within experimental error, 
the effects of any small population differences 
between the spin-labeled ligands bound in one or 
the other orientation cannot be distinguished due 
to the sbcth-powe* dependence of tt^ relaxation 
enhancement on the proton-electron separation. 
Hiis is readily seen by considering a case for 
which fi = 10 A and fj = 20 A. The calculated 
is 10.2, 11.2, and 14.4 A for /bi of 0.9, 05, and 
0.1, respectively. A similar condusion was made 
by Bonvin & Briingo- (1995, 1996) when they 
demonstrated that NOE-derived distances are not 
sufficient to determine the hactional occupaiKdes 
of the individual conformations of a protein that 
exhibits conformational heterogeneity. Because of 
this limitation, we conservatively estimate that the 
relative association constants for the binding of 
TEMPO-GIC4 to CBDni or CBDn2 in the two poss- 
ible orientations are within a factor of approxi- 
mately five- to tenfold. Note also that if = r2 and 
fbi= fh2f *en saturation of the CBD at 65% 
(/w = 0J25) and 80% (/bi = 0.4) leads to an 
that is overestimated by only ^^7% and 4%, 
respectivdy. 

Extrapolating from these studies, it is reasonable 
to suggest that TEMPGKIlCa and TEMPC)-Glc4 may 
be bound in mult^le conformations that have one 
or the other of these two general orientations, yet 
also differing in the exact position (or register) of 
the sugar with the p-sheet clefts of CBD^i and 
CBDn2. Evidence for this hypothesis stems from 
two observations. First, the longer values 
(>15 A) measured in this study, were consistently 
shorter than the electron-amide proton distances 
predicted from the calculated positions of the nitr- 
oxide atoms shown in Figure 5. Bearing in mind 
the significant enors associated with the measure- 
ment of these distances (see below), this could 
reflect a small population of ligand shifted to pos- 
ition the nitroxide near the center of the binding 
clefts of the CBDs and thus transiently close to 
amide protons in strand AS and ^heet B. Second, 
as noted previously Qohnson et al, 1996a), the ^H*^ 
and chemical shifts of CBDnj and CBD^j are 
perturbed similarly upon binding of GIC3, GIC4, 
GIC5, and GIc^. This suggests that the four cellooli- 
gosaccharides interact structurally with the CBDs 



in the same manner, possibly by sliding within 
their binding clefts* 

Structural symmetry can explain ttie multiple 
binding orientations ^ 

An explanation for the biiuling of cellooligosae^ 
charides by CBD^ji and CBD^q in multiple orierv- 
tations is provided by inspection of the structures 
of both the ligands and the protein molecules. 
Hydrogen bond formation between oligosac- 
charide and the protein provides an enthalpic driv- 
ing force for binding (Tomme et al., 1996a), and 
thus the locations of polar atoms within the mol^* 
ecules should be considered. As shown in Figure 7, 
the hydioxyl and hydroxymethyl groups of cellote- 
traose are approximately symmetrically disposed 
such that comparable hydrogen bonds could be 
formed^^to the CBDs whether tiie reducing end of 
the cdlooligosaccharide is located dois^ to 
P-strand Al or A4 In addition, as illustrated in 
Figure 8, the polar, non-polar, and aromatic side- 
chains implicated in cdlooligosaccharide recog- 
nition Johnson et al., 1996a,b) are approximat^ 
symmetrically positioned about the centers of th^ 
binding clefts of CBD^i and CBD^. Hiis provides 




Figure ^. Cellotetraose has approximate symmetry 
such that its hydroxyl and hydroxymethyl groups 
would occupy similar positions when the sugar chain is 
bound in eitfier orientation by CBDmi and CBDis^. 
(a), (b) Cellotetraose (Gepier et al., 1994) with its redu- 
cing end positioned at the bottom or top of the Figurei, 
respectively. The molecules are related by successive 
180*" rotations about axes perpendicular and parallel to 
the long axis of the cellooligosaccharide. 
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Rguie S. The residues implicated in cellulose recognition by the CenC CBDs are positioned approximately symme- 
tricaUy about the centers of th^ binding clefts. This may facilitate the binding of a cellooligosaccharide chain in 
eitiier orientation. Shown are ribbon diagrams of CBDim] (left) and CBDi^^ (^g^^) positioned for the reader to look 
across (top) and down onto (bottom) the cellooligosaccharide binding cieft that is formed by p-sheet A in each pro- 
tein. The side-chains of selected residues witfiin ^ cellulose binding clefts are color coded as blue, polar; red, hydro- 
phobic; and yellow, aromatic 



further opportunity for the complex formation of 
the cellooligosaccharides in multiple orientations, 
while maintaining similar patterns of hydrogen 
bonds to the equatorial hydroxy! groups and 
van der Waals contacts to the glucopyranose 
rings. 

It is unlikely that all the oxygen atoms in cellote- 
tiaose and the polar atoms in the binding clefts of 
CBDni or CBDn2 are involved simultaneously in 
intermolecular hydrogen bonding. The association 
of cellotetraose with CBDmi involves a AG" of 
-4.9 kcal mo|-^ at 35**C (Tomme et aL, 1996a). If 
the sole contribution to this binding free energy is 
assumed to be hydrogen bonding involving the 14 

Xhydroxyl groups, each would account for 
-035 kcal mol"- . Thermodynamic studies of 
several protein-carbohydrate complexes have 
demonstrated that the presence of some hydroxyl 
groups is essential for binding, whereas others are 
uitnecessary or even a hindrance. The net contri- 
butions of individual hydroxyl groups to binding 
generally range from —0.5 kcal mol"^ to —2 kcal 
mol"^ yet can be as high —6 kcal mol ^ (Lemieux, 
19%; Quiocho, 1993; Street et aL, 1986). Based on 



these values, CBDni likely forms between 23 and 
ten hydrogen bonds to ceUotetraose Given the 
large number of potential hydrogen-bonding sites 
on this sugar and on the side-chains lining the 
binding clefts of CBDi^ii and CBD^q, it is reason- 
able to^^uggest that cellooligosaccharides, as 'wJStt 
as amorphous cellulose, could bind these proteins 
in many possible conformations with similar free 
energies. These conformations include o'ppo^ite 
orientations of the sugar chain across the pi-sheet 
faces of the C6Ds> as well as possible multiple reg^ 
isters within their binding clefts. 

Further insights into the binding promiscuity' of 
the CenC CBDs are also gained from dynamic 
studies of the proteins and protein-sugar coin- 
plexes using ^^N and relaxation measure- 
ments. As will be reported elsewhere, the 
backbone amide protons and methyl-containing 
side-chains of CBD^j show motional disorder on 
a nana- to picosecond time-scale that is not 
restricted upon binding cellopentaose. This 
internal mobility or flexibility may also allow the 
cellooligosaccharides to bind in multiple, rapidly 
interconverting conformations. 



i 



Use of spin labels for obtaining llgand-protein 
distance restraints 

provide a powerf ul method for characterizing pro- 
tgir v<ar50hydl'at^ inttfa(.'tiuns'. UuaiitatiVely,"'thfe 
effects oi the frte raaical on uie NMR spectra of 
CBDni and CBDn2 yielded immediate evidence for 
the binding of TEMPOGlcg and GIC4 in multiple 
orientations. Although it was also possible to cal- 
culate the effective positions of the nitroxide moi* 
ety bound in a minimum of two possible 
orientations on the CBDs from a quantitative anal- 
ysis of die measured paramagnetic relaxation 
enhancement, the accuracy of these models is lim- 
ited for numerous reasons. These include: (i) com- 
plications associated with multiple conformations 
and partial saturation, as discussed above; (ii) 
the uncharacterized conformational mobility of die 
bound TEMPO group; (iii) inaccuracies in the 
NMR-derived structures of CBDni and CBD^z cal- 
culated without inclusion of the bound GIC4 or 
GIC5, respectively Gol™on et al,, 1996b); (iv) errors, 
which tend to lead to an underestimation of longer 
values of arising from difficulties in measuring 
small values of SRi and AR2, and from the possi- 
bility of relaxation enhancement due to non- 
specific binding or to collisions between CBD- 
TEMPO-GIC4 complexes in solution; and (v^ 
i^rWnf Higtencg restraints with < U ' 
fro m the efficiency of paramagnetic relaxation. The 

latter problem could be addre 

CBpKn and LB Prsg partially fatrated wi th_smaU 
quantitl g^f i'iiMFCKjlC4 Or with mbctui 
labeleaTu gar in its bkidized and reduced forms in 
orden o^identiiv the amides closest to the bound 
nitn^cidL 

;o'lmderstand in detail the mechanism of cellu- 
lose binding by the two Family IV CBDs, we are 
currently eOTloiting glycosynthases to prepare 
selectively ^K!-labeled cellooligosaccharides. Inese 
compounds will t>e utilized for isotope-edited and 
filtered experiments aimed at resolving and assign- 
ing intermolecular NOEs between the CBDs and 
the bound sugars. Using a combination of short- 
range distance restraints derived from interproton 
NOE interactions and long-range restraints from 
electron-proton paramagnetic relaxation enhance- 
ments, we hope to defizie more accurate structural 
models of the ensembles of the complexes of 
CBDni and CBDn2 with cellooligosaccharides. 

Biological implications 

Studies of numerous cellulases and hemicellu- 
lases indicate that CBDs facilitate hydrolysis by 
passively increasing local enzyme concentrations 
on the surface of ceUulosic and hemicellulusic sub- 
strates and, possibly, by actively disrupting non- 
covalent interactions between the chains of these 
polysaccharides (Tomme et al., 1995). The speci- 
ficity of CBDs for the various allomorphs of cellu- 
lose (e.g. crystalline or amorphous) wiU also serve 



to target these enzymes to distinct regions of this 
complex substrate. 

The results presented here demonstrate deaify 
that in isolation, CBD^x aiul CBD^i each bind sol- 
uble cellooligosaccharides in multiple orientations. 
This iirmiediately prompts the question as to ttte 
role played by these CBDs within their native coo- 
text. C&nC is a 1069 residue P-l,4-glucaiuise, coi% 
posed of CBD|sji and CBD^q arranged in taiuiem Jb 
its N terminus, a central family 9 catalytic domaij|^ 
and two additional domains of unknown functida 
at its C terminus (Coutinho et al., 1992, 1991). Th^ 
enzyme cleaves a variety of ceUulosic substrates 
and, based on an analysis of its hydrolysis of car- 
boxymethyl-cellulose, is semi-processive with both 
endo- and exoglucanase activities(Tomme et al., 
19%b). Althou^ the structures of the isolated 
family IV' binding domains (CBD^^ and CBDt^) 
and th6 homologous Family 9 ca^ytic domains 
from Clostridium thermocellum celD Quy et dl^ 1992) 
and Tfiermomonospora fusca E4 (Sakon et al,, 1997) 
have been determined, the spatial arrangemoit of 
these modules within native C. find CenC remains 
to t>e defined. 

CBDf^ and CBD^j are juxtaposed without an 
inter\'cning linker. Given that the affinity of the 
each domain for cellotetraose is equivalent whether 
joined or in isolation (unpublished), it is likely that 
CBDni ^Q can also bind oligomeric sugars in mul- 
tiple orientations. This theory is reasonable, as the 
binding- clefts of the two homologous CBDs both 
span approximately five glycosyl units and thus 
each domain in CBDninq can iiKlependently 
accomiRodate a single cdlooligosacchaiide mol- 
ecule. ISeliminary studies indicate that the affinity 
of CBE2f;iiN2 for phosphoric add swollen cellulose 
is only approximately 2 fold greater than that of 
CBDmi for this p>olymeric substrate (Toixuiie et aL, 
19%a). This implies that, when joined in tandem, 
these CBDs bind amorphous cellulose in an addi- 
tive, rather than cooperative, maimer. A simple 
interpretation of this result is that the two domains 
comprising CBD^inj are structurally constrained, 
due to the lack of a flexible linker, such that they 
cannot bind simultaneously to adjacent regions*(»f- 
a single polymer chain. We therefore speculate that 
the tandem CBDs anchor CenC to its natural sub- 
strate by bridging chains of amorphous cellulose 
without: a strong prefereiure for their orientatioivs. 
Note however, as discussed above, we can only 
estimate that the relative association constants for 
the binding of TEMPO-GIC4 to CBDi^i or CBDnj in 
the two possible orientations are within a factor of 
approximately five- to tenfold. Thus in combi- 
nation. CBDnin2 ^y in fact exhibit a significartt 
overall preference for binding strands of amor- 
phous cellulose positioned to match the yet 
unknown orientation of the two domains within 
this tandem CBD. Resolution of this important 
queshcm hinges upon the determination of the 
structure of CB[>nin2, ^s well as further character- 
ization of naturally occurring amorphous cellulos^e 
(Bayer et aL, 1998). 



CBDniisq is connected to the catalytic domain of 
CenC by a short proiine-rich polypeptide linker 
sequence (Coutinho et al., 1991, 1992), In contrast 
to the isolated CBDs, it is catain that this catalytic 
domain has a definite orientational requirement for 
the binding and hydrolysis of ceUulosic substrates. 
Although the exact boundaries and thus length of 
the linker sequence is unknown, the observation 
that it is highly sensitive to proteolytic cleavage 
(Tomme et al., 1996b) suggests that the linker 
affords at least some degree of flexibility between 
the binding and catalytic domains of CenC. This 
may allow the cleavage of properly oriented chains 
of amorphous cellulose other than those to which 
the CBDs are bound. Processivity would result 
from the diffusional sliding of both the binding 
and catalytic domains along the substrate cellulose 
chains. Enzymatic studies of CenC with and with- 
out CBPnin2/ ^ ^ structural characterization 
of the catalytic domain, are clearly necessary to 
understand the role of these Family IV CBDs in 
modulating the activity of this Cfimi p-l,4-gluca- 

nase. 

Although over 180 putative CBDs in 13 faxnilies 
have been identified, little is known about the 
detailed mechanism of cellulose and hemicellulose 
binding by these protein domains. Studies of sev- 
eral family 2 CBDs, such as those from C fimi Cex, 
have demonstrated that binding to crystalline cel- 
lulose is mediated by conserved aromatic residues 
aligned along a flat face of the molecule (Ku et al., 

1995) . Calorimetric measurements have proven 
that binding is entropically driven(Creagh et ai, 

1996) , indicative of hydrophobic stacking of the 
aromatic side-chains on the glucose rings. Hydro- 
dynamic studies have also revealed that CBDcex 
diffuses readily across the surface of crystalline cel- 
lulose Qecvis et al,, 1997). We therefore speculate 
that CBDcew ^ ^ members of other CBD 
families, also binds in multiple orientations to cel- 
lulose. In contrast, based on the crystal structure of 
a fragment of the endo/exocellulase E4 containing 
a family 3c CBD and a family 9 catalytic domain, 
Sakon et al. (l!?97) have hypothesized that the bind- 
ing domain aids in the activity of this enzyme by 
helping to feed a cellulose chain, bound in a 
defined orientatioa into the active site of the cata- 
lytic domain. TWs distinctive behaviour may result 
from the intimate structural association of the two 
domains in the T. fusca endo/exoceUulase. 

In sununary, we have shown that CBDni and 
CBDn2 from C fimi CenC bind TEMPO-labeled cd- 
looligosaccharides in multiple orientations. This 
promiscuous mode of sugar recognition can be 
attributed to the approximate synunetry of the 
hydrogen bonding groups on cellulose and on the 
side<hains lining the binding clefts of these two 
CBDs. Further structural studies of both native 
CenC and its substrate, amorphous cellulose, are 
necessary to understand the mechanisms by which 
this and other cellulolytic enzymes function in the 
dfident degradation of bionmss. 



Materials and Methods 

Synthesis of TEMPO-labeled cetlooligosaccharldea^:;. 

Details of the synthesis of TEMPO-Gka and TENffQ- 
Glq wiU be provided elsewhere. lonspray mass spec- 
troscopy of the Onal products yielded 658 (M+), 699 
(M + 1) and 660 (M + 2) for TEMPO-Gk3 (calculated for 
C27H4»NOi7: 658.7) and 820 (M+), 821 (M + 1), and 822 
(M + 2) for TEMFQClCi (calculated for C^sHshNO^: 
820^). Useful ^H-NMR data could not be obtained for 
these products due to tfie presence of spin label 
Structural characterization was therefore achieved by 
reduction of the nitroxide group to a hydroxylamine ai^ 
acetylation of the entire molecule. The spectral data for 
ttiese derivatives of TEMPCK^lcj and T£MPO-Gk4. 
respectivdy, are provided below. ' 

2^,6,6'Tetramethytpiperidirh- Uacetoxyl^yl 2,3,4,6'tetra' 
0-acetyt^'0^lucopyranosyH^~*^^hZ3,6'trhO'acetyh 
^'D-glucopyranosyh(1-^ 4)-2,3,6'triO'aGetyh^' 
D-giucopyranoside 

NMR (500 MHz, CDCI3) 8: 5.14 (dd, 1 H, ]U 
9.6 Hz, H-3), 5.13-5.09 (m, 2 H, H-3' H.3"), 5.05 (dd, 1 H, 
93 Hz, H-i"), 4.89 (dd, 1 H, ]yx Hz, fc-.r 
8.1 Hz, H-r), 4.85 (dd, 1 H, /j.^ 95 Hz, h r> 8.7 Hz, H- 
1\ 4.83 (dd, 1 H /aj 9.6, Hz 73 Hz, H.2), 4.60-4.47 
(m, 4 H, H-1 H-r H-r H6a), 4.42 (dd, 1 H. /^^ 
11.7 Hz, l^ s 1-5 Hz, H-6a'), 434 (dd, 1 H, /^-^^ 
12.4 Hz,:;^ 3- 4.4 Hz, H.6a"), 4.14 (dd, 1 H, 43 Hss, 
H-6b'), 4i)7 (dd, 1 H, 5.9 Hz, H-€b), 4.04 (dd, 1 H 
1.6' Hz, H-6b"), 3.93 (m, 1 H, H-l*"), 3.76 (dd, 1 BE, 
U- 5 9.4m H4'), 3.71 (dd, 1 H, /4 5 93 Hz, H-4), 3.& 
(m, 1 H, :H.5"), 3.63-338 (m, 2 H, H-5 H-5'), 2.14-1.98 (33 
H, 11 X Ac), 133^138 (m, 4 H, H-2'" H-3'"), 1.13 (6 % 
CH3), 1.10 (6 H, CH3). MS (lonspray) 1123 (M + 1); cal- 
culated for QvHtiNQw (1122.1). 



2,2,6,6'Teiramett}ylpipendin'1'acetoxyl'4'yl 2,3,4,6'tetrQ' 
O-acetyy^-ogtucopyranosyH'f-^^h 2,3,6'iriO'acatyl' 
p^o^luc^yranosyHI^ 4)- 2,3,6'triO'acetyl^^ 
[hgiucopyranosyl -<t-^ 4)- 2,3,6'thO'acetyh^' 
(>glucopyranoside 

^H NMR (400 MHz, CDCI3) 6: 5.15 (dd, 1 H, Ji, 
93 Hz, H-3), 5.12-5.08 (m, 3 H, H-3' H-3" H-3'"), 5M 
(dd, 1 H//4-jr 93 Hz, H-4'"), 4.89 (dd, 1 H, J^-j- SS^Hz^ 
Jr- r 8.0 -Hz, H.2'"), 436- 431 (m, 3 H, H-2 H-2' ^2*^, 
4^04.47 (m, 5 H, H-1 H-1' H-l" H-l'" H6a>, 4.60-432 
(m, 3 H, H-<;a H-6a" H-6a'"), 4.15-4.00 (m, 4 H, H-6b H- 
6b' H-eb" H-6b ") 3.95 (m, 1 H, H-l""), 3.81-3.68 '(m, 3 H, 
H-4 H-4" H-4""), 3.66 (m, 1 H, H.5"'), 3.62-337 (m, 3 A 
H-5 H-5' H-5"), 2.15-1.96 (42 H, 14 x Ac), 135-1.37 (m, 4 
H, H-2"'' H-3""), 1.15-1.10 (12 H, 4 x CH3). lonspray MS 
1411 (M + 1); calculated for QiH„7N03ft (141034). f 



Protein samples :-t 

The ^'N-labeled CBDni (residues 1-152 of CenC) arid 
CBDn2 (residues 146-296) were produced by expressic^ 
of the plasmids pTugNl and pTugN2 in Escherichia cdi 
]M101 cells grown in minimum media containing 1 g/1 
of ^NH4a and 1 g/1 of 99% "N-labeled Isogro (Isotec 
Inc.; Johi\son et al., 1996a). The secreted proteins were 
purified from the culture supernatant and die periplae- 
mic fractiiin using both affiruty chromatograpny on cel- 
lulose (Avicel) and ion-exchange chromatography, as 



{ 



described Qohnson et d., 1996a). Samples of CBE>n, and 
CBDm2 were exdianged into a final buffer of 50 mM 
sodium chloride, 50 mM sodium ^Hs-acetate, 3 to 5 mM 
CaQj. and 0.02% sodium azide in 10% ^H2O/90% HjO 
at pH* 6.1 using a microsqj concentration device (Fil- 
tion). CBDni binds caldiun with an equilibrium associ- 
ation constant of --10* M~" at pH 6.0 and dius exists as a 
1:1 CBDigj-Ca*^ complex under these conditicwis 
(Johnson et al„ 1998). The metal ion stabilizes the folded 
structure of CBDni but does not alter its affinity for eel- 
looligosaccharides. In contrast, CBDn2 does not bind cal- 
cium appreciably. 

X 

\ 

Titrations with TEMPO-labeled cellooligosaccharides 

The binding of Ae TEMPO-labeled ceUooligosacdiar- 
ides to the CenC CBDs at pH* 6.1 and 35*'C was 
measured quantitatively using ^H-^ H5QC spec- 
troscopy. Stock solutions of TEMPO-Glcj and TEMPO- 
Glc4 were prepared by weig^it in the above buffer and 
added in small aliquots to the protein samples. The 
titration of CBDni with TEMFO-Qcj was carried out by 
progressively adding the labeled sugar to 032 mM pro- 
tein up to a final ligand-to-protein ratio of 22:1. The 
Htrations of 0.65 mM CBDni and 0.48 mM CBDn2 with 
TEMPO<ik4 were taken to final corrected ligand-to-pro- 
tein ratios of 22:1 and 1.4:1, respectively. Protein concen- 
trations were measured using eisti = 21370 M"- cm"^ 
and 20500 M"^ cm'* for CBDni and CBDn2. respectively 
Oohnson et al,, 1996a). Equilibrium association binding 
constants were determined by non-linear least-squares 
fitting of tfie ^H'^ and "N domical shifts of numerous 
amide protons versus ceUooligosaccharide concentration 
to tfie Langmuir isotfierm describing the binding of one 
ligand molecule to a single protein site(Johnson et al,, 
1996a). These amide protons, which include Gly7, Glyl5, 
Val34 Gly44, Thi^, GlnSO, AsnSl, Thr«7 and Glyl30 for 
CBDni and Vall97, Tyrl98, Gly231, Tyr234 and Ala237 
for CBDn2' exhibit fast exchange between the free and 
bourul states on tfie diemical shift time-scale and are 
detectable, albeit with diminishing intansity, over the 
couxse of the entire titration. In the cases of the TEMPO- 
GIC4 titrations, good fits of the experimental data points 
to tfie binding isothCTms were obtained only if scaling 
factors of -13 (for CBDn2) or 1.8 (for CBDni) were intro- 
duced to correct for apparent errors in measuring the 
weight of the sugar (-1 mg) during the pr^aration of 
stock titrant solutions. The reported binding constants 
and errors are tfie avoages and standard deviations of 
the individual values measured for each *H*^ and ^^N 
nudeL 



NMR spectroscopy 

Spectra were acquired on a Varian Unity 500 MHz 
spectrometer at 35 ^'C. Each point of the titration of the 
"N-labeled proteins with TEMPO-GIC3 or GIC4 was mon- 
itored by the acquisition of a sensitivity-enhanced gradi- 
mt *H-"N HSQC spectrum (Kay et aL, 1992). Upon 
formation of tfie sugar-protdn complexes, ^H*^ values 
were measured using a sensitivity-enhanced gradient 
^H-'^N HSQC sequence as a read-out of a non-selective 
invereion-recovery experiment recorded with delays of 
t = 0, 0.1, 0.2, 0.4. 0.8 and 2 seconds. The / = 0.1 spec- 
trum was repeated to help estimate experimental error. 
A non-soisitivity-enhanced gradient 'H-^^ HSQC spec- 
trum was then recorded to obtain the spin-spin relax- 
ation enhancemOTts, ^2, of tiie amide 'H^ nuclei After 



completion of die data collection with atiier CBDni or 
CBI)n2 bound to die paramagnetic ceUooligosaccharide, 
die nitroxide functionality was reduced to die diamagr 
netic hydroxylamine by the addition of two molar 
equivalents of solid L-ascorbic acid (Sigma). The pH* irf* 
the sample was re-adjusted to 6.1 and the spectra necesst 
ary to measure AK} and AR2 were recorded. All spectra 
were acquired as 1024 x 96 complex points with spectral 
widths of 6500 and 1450 Hz in die ^H and dimen- 
sions, respectively. Selective water flip back pulses were 
utilized to ensure minimum pertuibation of die water 
magnetization (Zhang et al., 1994; Grzesidc & Bax, 1993). 



Calculation of A/?t and AR^ 

NMR spectra were analyzed using a combination of 
Felbc (Biosym Technologies) and NMRPipe (Delaglio 
et fl/., 1995). Data for the ^H T| series wm processed 
widi mild Lorentzian-to-Gaussian apodization. The rda- 
tive peak volumes, V,, at each inversion-recovery delay , f 
were fit 4o the function V, = V„ exp(-f/r i), as described 
by Farrow et al. (1994). V„ is the volume at time t - 0, 
and errors in the measured Ty lifetimes were estimated 
using Monte Carlo simulations. The paramagnetic 
enhancement of each amide proton spin-lattice relaxation 
rate, was calculated as: 



) 
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Ox and red indicate the data collected with the labded 
cellooligosaccharides in die nitroxide and hydroxylamii|e 
forms, respectively. 

The non-soisitivity enhanced *H-"N HSQC spjectea 
were processed without apodization in the protc^ 
dimension and with a mild exponential linebroadenii^ 
window- in die nitrogen dimension. Two methods weup 
used to misasure individual values of AR2* In the first, 
the ratio ot the peak volumes recorded for an amide in 
die presence of the nitroxide (V^) versus the hydroxy la- 
mine-labded {V^) cellooligosacdiaride is given by: 



e-7T2^ 



(2) 



where t= 10.1 ms is the total time during the INEPT 
and reverse-INEFT components of die HSQC pulse 
sequence in which ^H^ magnetization is in die transverse 
plane a&d thus subject to paramagnetic relaxatiopu 
A non-gradient enhaiKed HSQC was utilized to avoid . 
possible complications due to the use of two reverse- 
INEFT sequences. By measuring total peak volumes, and . 
not heights, the effect of diferential T2 relaxation during 
the detection periods of die spectra recorded in the pre- 
sence of the oxidized and rexluced sugars can be neg- 
lected. Peak volumes were determined by direct 
integration over a manually adjusted "footprint" and 
not by fitting of an idealized peak shape. Equation 
can be rearranged to obtain AR2.vcii* 

-=-=(a-feL=e)'"{^) ^ 

In the second method, AK2.LW were obtained from the 
change in the linewiddis of the ^H-"N cross-peaks in the 
proton dimension: 

^Ri.Lw=(~^ -(jl) j= nLW^ - LW,^) (4) 



* « 

where LWcw and LW^ are the full half-hd^t line- 
widths in the presence of the nitroxide and hydroxyla* 
mine-labeled cellooligosaccharides, respectively. The 
line-widths were measured by non-linear least-squares 
fitting of vectois, extracted in the proton dimension 
through a given cross-peak, to the equation for a single 
Lorentzian peak split by a ^/hn-h« covipling using ttie 
program PLOTDATA (TRIUMPH, UBC, Vancouver, 
Canada). The coupling constant was eitfter treated as an 
independent variable or fixed at the value measured pre- 
viously Qohnson et al,, 19%b) using ttie HNHA or 
HMQC-J experiments (Bax et al„ 1994; Kay & Bax, 1990). 
Significant mors in the measurement of AK2.LW arise if 
this coupling is ignored. The effects of die fiee electron 
on the relaxation rates of the amide nitrogens are neg- 
lected due to the small magnetogyric ratio of the 
nudeus relative to that of ^H. 



Calculation of 

The magnetic interaction of an ui^aired electrcHi 
and a proton is described by the modified Soiomon- 
Bloembergen equations (Sblonon Bloembergen, 
1956).The enhancement of tiie longitudinal (aRi) and 
transverse (AK2) relaxation rates of a proton due to tiie 
spin label are given by tfie equations (Koserv 1989; 
Gellespie & Stortle, 1997a): 

where K is tiie constant 123 x 10~^ cm'* s~^ for a nitrox- 
ide radical, r is ihe distance between ti\e electron and 
proton^ T,. is the correlation time for the fluctuation of the 
electronic-nuclear dipole-dipole interaction, aiul uih Is 
the Larmor frequency of the proton. These equations are 
based on the assumptions that the vector between the 
electron and proton is free to undergo isotropic 
rotational diffusion^ arui that tiie distance r is constant. 
The value of Xc individual amide can be deter> 

mined directly from the ratio of £ji2 to ^^1 by combin- 
ing equaticms (1) and (2) to obtain: 




(7) 



With a value of in hand, tiie distance between the pro- 
ton and electcOT, r, is calculated by rearranging equation 

(2): 

Individual values of rattier than a global average, 
were utilized to reflect directiy the experimentally 
measured relaxation enhancements for each amide. For 
an ensemble of interconverting protein-ligand com- 
plexes, the measured distances will reflect the weighted- 
average of the individual electron-proton distances and 
thus are denoted as fcff. 

Standard deviations for AR,, AI^2. fcff were cal- 

culated by conventional error propagation (Bevington & 
Robinson, 1992). 



Structu^ modeling 

The positions of ttie nitroxide moiety in TEMPO-GIC4 
boimd to CBPni or CBDt42 were estimated using .>a. 
simple restrained minimization protocol provided wi^ 
X-PLOR.-V.3.8 (Briinger, 1992). The orOy attractive force 
was the -soft-square potential of the distance restra^' 
energy term (£noc) ^ repulsive force was iMit 
corresponding to van der Waals interactions (£vdw)> 
bounds on tiiie effective dectnm-proton distances, r^, 
were defined for ttiree cases. First, for eight amide pro- 
tons in CBPtsii and ei^t in CBDn2 for which AR2 valued 
could not be obtained due to severe paramagnetic line- 
broadening (thus fcff < 11 A) , the distarKe bounds were 
set to the range of 1-11 A. Second, for 27 amide protons 
in CBDf^i and 26 in CBD^Q for whidi the measured 
were 11 to 15 A, the distances were restrained to 
r^i ±3 L Third, for 54 amides in CBDni and 59 in 
(^Dn2 witti > 15 A, the distaiKe bourtds were set 
from 13 A to an arbitrarily high value of 45 A. Although 
distances between 11 and 23 A were measured for 
CBDkii and CBDn2 (Figure 6), it was found ttiat ttiose 
>15 A were consistentiy underestimated in light of the 
dim&isions of these proteins. Therefore, r^n values g^ea^ 
ei than 15 A were used only to derive lower bounds to 
restrain the nitroxide atoms from approadiing the corre- 
sponding amide protons in the two protein structures. 

To account for multiple positions of TENfPO-GlCt 
bound to CBD^i or CBDt^ ensemble^veraging (Bonvin 
& Briinger, 1996) was applied to coordiruites systems 
containing one, two, or three nitroxides, defined as single 
atoms witii a mass of 30 amu and a van der Waal radius 
of 25 ArThe co-ordinates of the protdn (pdb file lULO 
for CBDni and a low-energ}' structure for CBD^ 
(unpublished)) were held fixed, and eadi ensemble bf 
nitroxide groups was minimized 60 times starting fro^ 
random positions. The quality of ttie calculated struicS 
tures were estimated by tiie NOE energy pnalty due to 
distance restraint violations* 
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The geometric relationships between ligands and tfie functional gioup>s 
that bind ligands in soluble ligand-protein complexes have traditionally 
been deduced from distance constraints between pairs of NMR active 
nuclei spanning the ligand-protein interface. Frequentiy/ the steep inverse 
distance dependence of the nuclear Overhaxiser efiect (NOE), from which 
the distance constraints are derived, makes identification of sufficient 
numbers of constraints difficult. In these cases the ability to supplement 
NOE-derived information with distance-independent angular information 
can be very important. Here, the observation of residual dipolar coup- 
lings from a-methyl mannose boimd to mannose binding-protein in a 
dilute liquid crystalline medium has allowed the determination of a 
bound ligand's average orientation. The 3-fold rotational symmetry of 
mannose-binding protein defines its orientational tensor and obviates the 
need to determine experimentally the protein's average orientation. 
Through superimposition of ligand and protein orientational tensors we 
describe the binding geometry of a-methyl mannose bound to mannose- 
binding protein. This new method is of general applicability to the study 
of ligands bound to proteins, and it is of particular interest when neither 
X-ray crystallography nor NOE techniques can provide sufficient infor- 
mation to describe binding geometries. 

® 1999 Academic Press 

Keywords: dipolar couplings; NMR; structure determination; carbohydrate 
interactions; mannose-binding protein 



Introduction 

We illustrate here a new approach for the deter- 
mination of ligand geometiy in protein binding 
sites that relies on orientational constraints derived 
from residual dipolar couplings observed in NMR 
spectra of partially ordered macromolecular com- 
plexes (Bothner-By, 1995; Prestegard, 1998; 
Prestegard et al,, 1998; Tjandra & Bax, 1997; 
Tolman et aL, 1995). The determination of the 
bound geometry of protein Hgcinds is an important 
issue for improving our vtnderstanding of how 
structure relates to biological function, and also for 
building a basis for rational drug design. Many 
useful approaches to this determination have been 
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developed in the past, including X-ray diffraction 
of crystalline protein-ligand complexes and NMR- 
based transfer NOE studies of complexes in sol- 
ution (Ni, 1994). All the approaches are limited in 
certain ways. In the case of X-ray crystallography, 
complexes must be crystallizable; in the case of 
transferred nudear Overiiauser effect (NOE) stu- 
dies, ligands must be in rapid exchange. Transfer 
NOE studies, while giving sound information on 
ligand geometry, also svtffer from a lack of infor- 
mation on the nature of ligand protein contacts. 
Information on these contacts can be provided by 
more conventional NOE studies using both protein 
and ligand resonances in some cases, but tius infor- 
mation is often difficult to obtain because it 
requires assig;i\ment of protein as well as ligand 
resonances, and because it presimies sufficiently 
dose approach of protein and ligand protons to 
allow effident magnetization transfer (r < 5 A). 

Cases where determiiuition of bound ligand geo- 
metry have been particularly difficult indude those 
involving protein-oligosaccharide interactions. 
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These cases encompass an important area in that 
protein-oligosaccharide interactions mediate a 
variety of processes essential to cell function and 
survival. To name a few, sialyl Uwis-X oligosac- 
charide-selectin interactions mediate the recruit- 
ment of leucocytes to sites of mjury. The 
interaction of choleratoxin with the oligosaccharide 
headgn>up of gangUoside CM, mediates mtemaliz- 
•tion of U» toxic subunit, and polylactosanune- 
Kftledin lntef»ctions are hypothesized to be 
givolv^ in the metastasis of mahgnant cells 
Qiaracterization by traditional NMR 
Skosaccharide-protein interactions 
-.-^i^- w-ewprocesses is diffioilt becaiise 
^^Mmhixydkag networks involving hydroxy! 
^^^^fi Oiemigais are often part of the interface 
'^^m piotetn and oligosaccharide. The hydroxyl 
mMi^m ^ the ones in closest proximity to 
irotons, but these exchange rapidly with 
protons In bulk water making their NMR reson- 
Mos^hard to observe. In these difficult cases, 
^i^diial dipolar data can offer a valuable altema- 
^"^";?W?io« the data can in principle constrain both 
'"^■^ligand geometry and ligand orientation 

t to the protein binding site. 

..Spiif'^particuto target in the studies presented 
'^iere is mannose4)in<£ng protein- A (MBP). MBP is 
a protein important in irmate immime response. It 
binds sugars, irKhiding a-mannosides, that are fre- 
quently exposed on the surface of invading patho- 
gens, aiKl subsequently triggers a complement 
cascade (Dceda et al., 1987). Improper activation via 
MBP has also been suggested as a factor in inflam- 
matory disease (Dwek, 1996). Crystal structures of 
several forms of MBP are available. The form we 
study here is a 149 residue truncated version that 
iiKludes a calcium-containing carbohydrate recQg- 
lution domain (CRD) and a short a-helical stem 
that initiates the formation of a homo-trimer by 
making a left-handed coiled-coil with helical stems 
from two other MBP monomers. The 3-fold sym- 
metry axis in the resulting homo-trimeric structure 
is important for the interpretation of our results. 
The binding of several ligands to MBP and mutant 
MBPs that mimic selectin proteins has also been 
studied, both because of interest in MBFs ligand 
binding properties and those of proteins which 
show homology to the MBP CRD, such as the 
selectins. Crystallography has been the primary 
contributor in studies of MBP trimers, wilii struc- 
tures existing for boimd galactoside and galactosa- 
mine, as well as sialyl Lewis-X and related ligands 
(Chang et al, 1994; lobst et al, 1994; Kolatkar AR, 
1998; Ng & Weis, 1997; Weis & Drickamer, 1994). 
Additionally, there are structures of a shorter frag- 
ment of MBP which makes a homo-dimer; one of 
these includes a-methyl mannoside in the carbo- 
hydrate binding site (Ng et ai, 1996). The results of 
the above crystallography studies provide a useful 
basis for evaluating data from our new approach. 

The new approach that we present is based on 
the observation of residual dipolar coupling of 
directly bonded pairs in a-methyl marmo- 



side (AMM) when the maiuioside is bound to MBP 
in a field-oriented aqueous liquid crystal. The use 
of residual dipolar coupling in the characterization 
of the structures of proteins ai>d other macromol- 
ecules in solution has received a good deal of 
attenHon recently (Clore aL, 1998a; T)andra k 
Bax, 1997) . When molecules are partially ordea\J 
in a magnetic field interactiofv vectors connecttng 
magnetic nuclei in the molecules depvt from a 
complete isotropic salnplif^^itt^dl bi 
S^ves rise to • conybiHiSifer»l^ 
of NMR resonances fbai depmdi^S^''^'^^^ 
properties of the into^^ig^^SI 
separation (r) and the ang^j^l^ 
makes with the magnetic ifit^^'^W^ mkld 
are directly bonded as In a 'H-^ ot^H-nc: 
the l/r^ distanceniependence of tfie interaction can 
be regarded as being determined by the bond 
length and die primary variable becomes the orien- 
tation of the l>ond relative to the magnetic field as 
characterized by the angle 9. The functiorul depen- 
dence of the dipolar couplings is <l/2(3cos* 
0 - 1)), where the parentheses denote an average 
over an orientation distribution, and the resulting 
interaction appears as an addition to the normal 
one bond scalar couplings. These through space 
interactions are always present; in fact, they are the 
basis of the common NOE interaction. However, 
the angular function above is averaged to zero 
when space is isotropicaUy sampled and direct 
splittings of resonances orily appear in partially 
oriented media. Fortunately, several means of pro- 
ducing field-ordered states are now available, 
including inherent orientation due to the large ani- 
sotropics in the magnetic susceptibilities of some 
molecules (Kimg et d,, 1995; Tolman et al, 1995), 
orientation due to the interaction of molecules with 
lipid bicelles that form field-oriented liquid crystals 
(Tjandra & Bax, 1997) and interaction with filamen- 
tous bacteriophage that also form field-oriented 
liquid crystals (Clore et al,, 1998c; Hansen et al, 
1998). 

The interpretation of residual dipolar data has 
taken two routes, incorporation of individual bond 
constraints as penalty functions in simulated 
aimealing protocols for molecular structure deter- 
mination (Clore et al., 1998a,b), and extraction of 
order tensors that directly describe the direction 
and level of ordering forces from the point of view 
of a coordinate frame fixed in a rigid molecular 
fragment (Losonczi et al, 1999; Losonczi & 
Prestegard, 1998b; Saupe, 1968). The latter 
approach which arose from substantial previous 
work in the liquid crystal field, proves to be par- 
ticularly valuable for the problem at hand. An 
order tensor is a 3 x 3 matrix with elements 
(l/2(3cos6yCOsey - 6,y)) written in terms of direc- 
tion cosines that relate Cartesian axes of an arbitra- 
rily chosen molecular axis system to the magnetic 
field direction. Because the matrix is traceless and 
symmetric, there are only five independent 
elements. An order matrix in an arbitrarily chosen 
frame of a molecular fragment is difficult to inter- 
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pret, but if diagonalized, the five independent vari- 
ables turn into a principal order parameter, an 
asymmetry parameter and three Euler angles relat- 
ing the diagonal, or principal frame, to the original 
molecular frame. If fragments of interest experience 
a common source of order, as they would if they 
were parts of a rigid ligand-protein complex, the 
directions and levels of the orienting force should 
appear the same from the point of view of each 
fragment when a model for the fragments is 
assembled with proper fragment orientations. This 
provides a test for the validity of proposed models 
for die orientation of a ligand (one fragment) in a 
protein binding site (another fragment). 

The key to using dus test is having an approxi- 
mately rigid fragment with enough potential dipo- 
lar coupled pairs to determine the five elements of 
an order tensor. For a "N-labeled protein, pairs of 
^H-^^ spins are deariy abundant and mudi.of die 
protein backbone can be considered rigid. In oligo- 
saccharides, individual pyranose rings frequently 
prefer a configuration whkh can be considered 
to have approximately rigid geometry and 
pain are abundant hi OL-memyi mannoside there 
are five direcdy bcmded pain on the 

rin£, two of whidi point in two unique directions 
ana three of whidi depart slightiy from a third 
direction. These pairs can provide the data for an 
order tensor determination. There are a number of 
ways of accurately determining the splittings of 
resonances from these pain (Tohnan & Prestegard, 
1996) but the easiest is simply to coUect ^H-^^ het- 
eronudear single quantum coherence (HSQC) spec- 



tra, without decoupling in one of the frequency 
domains. This is the approach we use here. We 
used uniformly ^^-labeled a-methyl mannoside in 
order to improve sensitivity. 

Results and Discussion 

Measurement of residual dipolar couplings In 
AMM complexed to MBP 

The *H-^^ HSCJC NMR spectra of ^^^enriched 
AMM in the presence of an equimolar amount of 
MBP dissolved in a dihite bicelle medium are 
shown in Figures 1(a) and (b) at temperatures of 
25X and 39°C, respectivdy. The data were 
acqtiiied in the absence of decoupling during 
acquisition, and the one bond "C-^H couplings are 
shown as frequency domain splittings in the pro- 
ton dimension. The magnitudes of these couplings 
and the corresponding errors extracted using a 
Bayesian parameter estimation method (Andrec & 
Prestegaid, 1998) are ^wn on the spectra. As one 
would expect for the case of isotropic tumbling at 
25°C, all tfie couplings in Hgure 1(a) are similar 
and M in die range of typical Vch scalar coimliitg 
constants for non-anomeric sites in carlx>hydrates 
(140-150 Hz). On the other haiul, couplings 
measured at 39 °C are more widely spread because 
of the presence of a residual dipolar contribution. 
Residual dipolar contributions for individual C-H 
bond vectors calculated from the differences in 
couplings measured at 25**C (isotropic) and 39 °C 
(aligned) are shown in Table 1 {D^. 
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Figure 1. (a) Section of a proton-coupled HSQC spectrum of isotropic AMM in MBP with selected couplings anno- 
tated, (b) Section of a proton-coupled HSQC spectrum of oriented AMM in MBP with selected couplings annotated. 
The anomeric regions of the spectra are not shown but show splittings of 169.4(±0.5) Hz for isotropic condition and 
158.6(±0.2) Hz for the oriented case. 
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Table 1. Residual dipolar couplings for bound and free AMM 



Data Set 



CrH, (Hz) 



Cj-Hj (Hz) 



C3-Hj (Hz) 



C4-H, (Hz) 



AMM + MBP 4- hkeWe (O^ 
AMM -¥ bicelle (O^^) 
AMM-bound state (0^^^) 



-10^±0.7 
-3.5ib7 
-22.6±2^ 



-15.2±0.7 
-73±07 
-287±2J 



9.2±0.9 
20±33 



16.6±1 

12.1±07 

23.9±2.9 



C5-H, (Hz) 



1M±1.1 
7J±03 
303±3.0 



While residual dipolar coupling contributions 
are indeed observed in the AMM/MBP complex 
disscrived in a bicelle medium, they need not orig- 
inate fuDy from binding of AMM to a strongly 
^^Hffne^ We see only a single set of 

^^^-^ * ^ ^ but for a weak binding ligand in fast 
duit spends a portion of its time bound 
m and remainder free in solution, 
^ fliito wffl always be the case. Thus, observed split- 
tings will be a population weighted average of 
those induced in the free and bound states. 
Residual dipolar couplings have previously been 
observed in finee carbohydrate molecules dissolved 
in biceUe media (Bolon & Prestegard, 1998; Kiddle 
& Homans, 1998; Rundlof et al, 1998). WhOe small, 
diley can be siginificant. In order to quantify the 
contribution from the AMM-£ree state, the same 
experiments were repeated under identical con- 
ditions in the absence of MBP. The differences in 
measured couplings between 25 "C and 39 *C are 
also shown in Tabte 1 {Df„J. Again, couplings and 
associated errors were extracted using a Bayesian 
parameter estimation method. It is noteworthy that 
tfte magnitudes of couplings for various C-H bond 
vectors are generally smaUer for free AMM 
samples, but die reduction varies from site to site. 
The different pattern of splittings assures us that 
we are' not looking at a simple scaling of bicelle-' 
induced order and that MBP has a significant 
direct e^ect on both extent and direction of AMM 
aligrmient. This direct binding effect was con- 
firmed in another set of experiments. Here, a molar 
ratio of 1:8 MBP: AMM was used, which would 
reduce the bound AMM state to approximately 
8%. Indeed, in this case where indirect effects 
would domiruite, the presence of MBP had little 
measurable effect on the observed residual dipolar 
couplir^ 

Interpretation of residual dipolar contributions 
of a ligand in a protein-bound state will, in gener- 
al require separation of the contribution fi^om the 
free state using known binding properties. The 
observed residual dipolar coupUngs measured in 
the AMM/MBP complex (Table 1, will be a 
population weighted average from the free {Nf^) 
md bound (Wbound) states, such that 

Dobs = WfreeDfree + NboundDbound (1) 

fraction of AMM free and bound (Nf^ and 
:) can be calculated from the dissociation con- 
arui known protein-ligand concentrations, 
a previously determined binding constant 
1 mM (Sayers, 1998), approximately 40% of 
is in fact bound to MBP leaving approxir 



mately 60% of AMM free in solutioa Residual 
dipolar couplings originating from the bound state 
(Abound) calculated fiom equatldn 0) are shown In 
Table 1. Since these dip(rfaf IS^uiAb^ originate 
from MBPs partial &ffmBDm^ 
desired orientational infwffii^ih^^ 
to MBP. This information isi tNst extractad t^lng an 
order matrix analysts. ^: ; ■ ■ 



Order tensor calculation in AMM 

Using AMBER-nunimized stmcture coordinates 
for AMM (Woods et d,, 1995) and five residual 
dipolar couplings measured in the sugar ring in 
simple bicelle solution, the five dements of the 
order matrix were determined usirtg a singular 
value decomposition approach (Losonczi et al., 
1999). The initial moleodar coordinate frame for 
AMM was defined by placing the y-axis along the 
Cl-Hl vector and die z-axis normal to the plane 
defined by Hl-Cl-OMe. Using this frame, the pos- 
ition of the principal order frame axes detemuned 
from residual dipolar couplings from free AMM 
i^fne* Table 1) is illustrated in Figure 2(a) iising a 
Sauson-Flaumsteed projection ^ugayevskiy & 
Snyder, 1995). The aligiunent tensor is extremely 
asymmetric and the directions of all three principal 
axes of the alignment tensor are well defined, 
except that inversion of any axis is allowed. The 
asymmetry is quantitatively defined by an asym- 
metry parameter eta (ti) that can be written in 
terms of order parameters S^- (r| = ($„ - S^/S^. 
It is equal to 0.8(±0.13). Asymmetnc behavior is 
not unusual and has, in fact, been observed in 
many systems dissolved in bicelle media. Due to 
the high level of asymmetry, the direction of high- 
est order (S^J can alternate l>etween two orthog- 
oruil orientations (Figure 2(a)) for which S„ 
assumes values similar in magnitude but opposite 
sign (S^ = -h 0.0005 or -0.0005). Note tfiat one of 
the possibilities for the higher level of order is 
nearly along the x-axis of the initial molecular 
coordinate frame (the center of the Sauson-Flaum- 
steed plot). Hence, it is nearly in the plane of the 
Hl-Cl-OMe fragment and perpendicular to the 
C-H vector. 

The five elements of the order tensor for AMM 
determined from couplings for the MBP-bound 
state (Dbound) were also determined using a singu- 
lar value decomposition method, and the direc- 
tions of the principal axes are depicted in 
Figure 2(b). While the position of the z-axis, the 
most strongly ordered axis, is weiU defined, the 
position of die and Syy axes are ambiguous. 
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FiguK 2. (a) Sauson-Flaumsteed projection of the 
direction s of highest order for oriented AMM without 
MBP. (b) Sauscnt-Flaumsteed projection of the directions 
of highest order for oriented AMM in the presoice of 
MBP, witii a dear depiction of axial symmetry. The 
molecular coordinate frame for AMM was defined by 
placing tiw y-axis along tiw Cl-Hl vector and the z-axis 
ncHmal to the plane dcdoned by Hl-Cl-OMe. 



This is a characteristic of an axially syinmetric 
order tensor. The asymmetry parameters confirms 
this axially symmetric nature of the aligrunent ten- 
sor with Ti = 0^4(±0.16). In addition, the level of 
order (S„ = - 0.0012(±0.00012)) is approximately 
2.5 times larger than that for AMM in free solution, 
and the direction of highest order is shifted 40" 
toward the molecular z-axis (out of the plane of to 
the Hl-Cl-OMe fragment). 

The direction of alignment for the boimd ligand 
is most useful when referenced to the alignment of 
MBP. We know that the direction of highest order 
(^zz) depicted in Figure 2(b) should coincide with 
the direction of highest order for MBP itself. Rotat- 
ing the molecular frame of the sugar to achieve 
coincidence then allows determination of the rela- 
tive orientation of AMM bound to MBP. In general, 
determination of the alignment from a protein is 
possible using . *H-'^N data as we have recently 
demonstrated for other systems (Fischer et al,, 



1999). However, careful consideration of MBFs 
structure, and in particular, its symmetry, allows 
us to omit this independent interpretation and 
indeed explain the appearance of axial symmetry 
depicted in Figure 2(b). 



MBP's alignment tenson a solution 
In symmatry 

The form of MBPv.twd Jn 
hoimHrunar wMi >a 



atudks it a 
axia and 




The computed 

b« faom ti|A^ tnMin^^ ^^'^^--^^ 
mm be an awn^ jovik diriditat that ate 
tdaled by MBP% aymmetiy axis. As has 

Ions been recognized and used in studies of snud] 
motocules (Sauper 1968) this averaging has a very 
specific effect on order matrices determined from 
averaged couplings; regardless of the nature of 
MBPb interaction with tibe bioelle medium, AMM's 
experimentallv determined direction of highest 
order must always point along die symmetry axis 
of MBP aiul the lesuJting order tmsor must always 
be axially symmetric (Al-Hashimi et al„ 1999). 

In the abserKe of any large confbrmatiorud flexi- 
bility aiul mobility in the bound state, the order 
parameters determined from botmd AMM (S^ = 
- 0.0012(±0.00012)) will reflect MBP's order par- 
ameter. The fact that the principal order parameter 
is negative indicates diat MBPs symmetry axis is, 
on average, perpendicular to the magnetic field 
and parallel with the normals of the bicelles in our 




Figure 3. Depiction of AMM in fast exchange with the 
three equivalent CRDs of MBP. The 3-fold axial sym- 
metry results in a direction of highest order along the 
rotor axis. Yellow spheres correspond to Ca^"^, black 
and red shperes to carbon and oxygen, respectively, of 
AMM, and MBP is represented by ribbon diagram. 
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medium (these order with their normals perpen- 
dicular to the magnetic field; Sanders et al, 1994). 
This is as one would expect if interactions with thw? 
surfaces of bicelles dictated order of MBP. Axial 
symmetry should also propagate to the bound 
ligand and this is exactly what we observe in the 
order tensor determination using residual dipolar 
attributions from MBP bound AMM as shown in 
Figure 2(b). The directioit of highest order, in 
Figure 2(b) fhouM tKmr point d^g the tymmetry 
asds of Mil^ As We dlsciiis below, Oie orientation 




^ No crystal ^ structure erf triniuedib with 
AMM in the binding site actuaUy exists. However, 
AMM in the site of a dimer of the homologous 
MPD<:, missing the helical stem exists, and AMM 
in the trimer can be positioned by overlaying a 
caibohydrate4>inding domain from tf\e dimer on 
the three domains in the trimer structure (PDB 
entries IKMB and IRDL, rmsd 0.63-0.78 A). The 
orientation of AMM, when positioned so that the 
experimentally determined symmetry axes coincide 
with that of MBP cannot, however, be made to 
overlap with the orientation of AMM modeled in 
this way. It is displaced by approximately 40*. 
There are several possible explanations for this 
ot)served deviation. First, our use of the trimer as a 
model may be in error. Instead, an equilibrium 
among monomer, dimer and trimer forms may 
exist in solution. This possibility is imlikely, since 
linewidths and" diffusional properties of the con- 
struct used in previous studies are consistent with 
a trimer (Sayers, 1998). Moreover, the experimen- 
tally determined axiaUy symmetric tensor is con- 
sistent with the expected 3-fold averaging. Second, 
the actual geometry of the trimer in solution may 
be different from what is depicted in the crystal 
structure. This is not out of the question; there are 
in fact X-ray structures of mutated MBPs in tri- 
meric aggregates which show substantial differ- 
ences in the relative placement of the CRDs, even 
though mutations are concentrated in loops at the 
extremities. For example, the helices near the tri- 
mer junction in PDB entries IKMB and lAFA are 
displaced from one another by more than a 1 A 
wfen the stems are aligned. Similarly, the presence 
of internal motion in each protein domain or in 
AMM in the bound state can lead to deviations of 
jJJ^ge orientations. Third, it may be that alternate 
geometries are possible. A second binding 
njvolving a second Ca^^, in each CRD has, in 
.Deen observed in crystals containing a high 
««ccharide (Ng et a/., 1996). If this site were 
|P^P«" model would have to include 
orientation of the sugar in this 
u^*«}f second and third cases would 
f™ly symmetric nature of the deter- 
but the direction of highest 




order will vary. A fourth possibility is that the 
binding geometry in the primary site can vary. 

In Figure 4 we show a geometry of AMM in the 
binding site of MBP (IKMB.pdb) that is consistent 
with our determined axis of synunetry. Using this 
refined structure, s-methyl n\annose was docked 
to the binding calcium of the CRD of MBP. Orien- 
tational constraints required pladng AMM's exper- 
imentally derived direction of highest order along 
MBP't a)d|v|p^naunetr^ SutMsequenUy. 

rotations timindp^ translaliliH along x, 

1*PlgSi4. 

^^^^ aDowied van der 
6 betweoi HO-3 and 
Glul93 of MBP. The toimerk Ibnn Wl in our 
modeling has rK> ilgand of AMM in tfie binding 
site and Glul93 coordirution of the calcium com- 
pensates for the absence of sugar hydnoxyl. Hence, 
it is quite likely that Glul93 moves on sugar bind- 
ing. MBP actually has a radier open binding site, 
in which coordination of the three and foiir oxygen 
atoms of the sugar to Ca^**" are the primary inter- 
actions and considerable variation in the actual 
geometry of coordination may be possible. 

An independent determination of MBPs order 
tensor woilld remove some of the imcertainties in 
the above discussion. First, this would allow deter- 
mination of the relative domain orientations of 
MBP in solution. This can be achieved by equating 
the observed direction of highest order as seen 
from any single domain to the symmetry axis of 
trimeric MBP (Al-Hashimi et a!„ 1999). This pro- 
cedure, in fact, introduces a novel and simple route 
into determination of relative domain orientations 
in certain types of symmetry related homo-multi- 
mers such as MBP, a problem that so far has been 




Figure 4. Proposed mode of binding of AMM in MBP. 
The dark blue corresponds to Ca^"^, and the light blue 
represents residues in the binding pocket of MBP. 
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very difficult to address using standard NMR tech- 
niques. Second, the determined direction of highest 
order and the comw^wding order parameters wlU 
haw absorbed any Internal motional averagirig 
corttHbutkm within MBP pcrmittli^ direct com- 
parhon whh an or^lewor determined for AMM 



dUortytated a-melhyl mannoside. Flash chromatography 
followed by dcacctylalion %^th NaOMe/McOH pro- 
vided AMM in an 20% oversU yield. 



NMR spadroaeopy 



Two Mfnpbt wmw ptvparvd: (1) • t mM MBI'-A. 




ir\ -.•5 ,■. 





"il^Snment 



Preparation of MBP-A 

MBP-A is an expression product from the pIN-IDoin- 
pA-2 pUsmid (Drickamer, 1989), and was expressed as 
described (Weis et 1991). Briefly, competent JA221 
cdb wefe transformed with DNA plasmid and colonies 
selected from LB-amp plates for use in a starter culture 
in an LB-amp medium. Starter culture was grown to sat- 
uration ovemi^t at 37 "^C and used to inoculate five 
liters of LB-amp medium, which grew to A^so of 0.8. 
MBP-A production was then induced with IPTG and 
additional growth allowed for 2.5 hours. Cells were har- 
vested by centrifugation at 4000 g, the pellet was resus- 
pended in 10 mM Tris-Cl (pH 7.0) and then lysed by 
sonication. The insoluble pellet from 18,000 g centrifu- 
gation was solubulized in 6 M guanidinium chloride, 
0.1 M Tris-Cl (pH 7.0), and clarified by centrifuging at 
138,000 g for 30 minutes at 4*'C. Dialysis of the centri- 
hige extract with 25 mM Tris-Q, 1.25 M NaQ, 25 mM 
Caa2 over 48 hours and centrifug^g at 138,000 g pro- 
vided the dialysate which was purified by affinity chro- 
matography on a mannose-Sepharose columi\. Elution 
with 25 mM Tris-Q, 1.25 M NaQ and 2.5 mM NajEDTA 
followed by reconstitution by dialysis agaiittt 10 mM 
NaQ, 1 mM Tris-Q, 25 nr\M CaQa afforded 10 mg of 
MBP-A from a five liter growth culture. 

Preparation of AMM 

AMM was synthesized from 1*^6) glucose by minor 
modification of a described method (Hare et al, 1993). 
Briefly, tmiformly '^-labeled t>glucose was protected as 
the penta-acetate with acetic anhydride, and the crude 
material brominated with HBr/AcOH. Reduction with 
Zn/aqueous AcOH provided the desired tri-O-acetyl-D- 
glucal, which was used in a Ferrier reaction with metha- 
nol/E)owex H"^ to give (a-methyl-2>dideoxy-4,6-di-0- 
acetyl-D-glucopyranoside. Purification by silica gel 
chromatography; followed by treatment of the anhyd- 
rous material with OSO4 in pyridine afforded the desired 



_ normal 

^ fiwffiii itpiMiluin oc^iercnoe 

Inodified 90 ttuS>% coupliitgs 

mt^^^ d^ paiiSxi dimen^cxi. Quadrature 
In the ir w accon^Ushed 
nsing gradient cctoence sdectknv For all experiments, 
1024 points per scan, 16 scans per increment 160 f, incre- 
ments were acquired witfi a direct sweep widtt\ of 
3000 Hz and an indirect sweep widtii of 9000 Hz. Usir\g 
the same acquisition parameters, another identical HSQC 
experiment was acquired at a temperature of 25 "C 
where the biceUe medium is in an isotropic solution 
state. 

Calculations of dipolar couplings and order tensors 

Dipolar couplings were calculated as the difference 
betweei the oriented coupliixgs 0/ch + ^ad ^ iso- 
tropic couplings (Vch)- In each case, couplings were 
extracted using a Bayesian time-domain NMR parameter 
estimation program Xrambo, using the method described 
(Andrec & Prestegard, 1998). This program 'is available 
on the Internet at http://tesla.ccrc.uga.edu. Typically, a 
2D HSQC data set with the desired spiittiiigs in the 
direct dimension was transformed and phased to yield a 
1024 X 256 real matrix Single FID slices across Ae width 
of the resonarKe uruler investigation were aiudyzed inde- 
pendently. For every slice, the restilting frequency 
domain data were reverse Fourier transformed to gener- 
ate a ID time domain data set as ir^ut for Xrambo. The 
following model was used for tiie data. Each component 
of the doublet resulting from C-H couplings was given 
an identical linewidth and intensity, but an independent 
phase to circumvent any problems resulting from the 
presence of phase twist anomalies ox dispersive contri- 
butions. Values for shifts, linewidths, phases and intensi- 
ties were estimated and entered as starting parameters 
which were subsequentiy refined by Xrambo's Metropo- 
lis Monte Carlo method. This procedure was repeated 
for several indirect slices of the same doublet and the 
resulting rmsd across various slices was used as the 
uncertainty in measurement. The error analysis from this 
procedure was used to estimate the final precision of all 
measured residual dipolar couplings. The measured 
residual dip>olar coupl^gs and associated uncertainties 
along with AMM input coordinates obtained from an 
MD simulation were then used as input to a singular 
value decomposition program for the determination of 
order tensor elements (Losonczi et al, 1999). This 
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program is also available on the internet at http:// 
teslaxorc.uga.edu. 

Molecular dynamics simulation 

MD simulations wen? performed with AMBER 
4.1 (Pearlman et «/./ 1995) employing the all-atom 
GLYCAM93 parameter set for oligosaccharides (Woods 
If fl/v on a 12 processor Sd Origin 2000 comp- 
iiter. AMM was solvated in an approximatdy 
d2 A|( 32 A x 32 A box of 466 T1F3P water molecules 

iipRn^ed pair mteractions (tf 8^ A. IMai 
lit^^llii; were assigned at 5 Kr and the simu* 
j&l^pu^?^^^^^ at 1 atm. Selected torsions were 
WSKi^^aik^ CARNAL module of AMBER 4.1 hm 
the tnf^^ determine a dosdy related family of 
the most populated structures. 

Ooddng atudlae 

Wst 'ijppib^ of AMM's position in MBP were 

cim^MWi MIDAS Phis (Ferrin et al., 1988). Briefly, MBP 
with AMM modded into CRD were aligned witii tfte 
helix paraJki with the 2-axis. A second AMM was 
matched aikl oriented according to the singular value 
decompositioh analysis of the ^culated dipolar coup 
bngs (a 2 - 10% y + 40" rotation from fhe crystal struc- 
ture). This was then translated in x, y, and z directions 
while linutii^ rotations to the z-axis only. Solutions that 
maintained the three arui four OH groups 2.6-2.8 A from 
the Ca"^^ responsible for binding and that minimized van 
der Waals interaction with the protein were deemed 
plausible solutions. 
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